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“Yes,  yes,  I  know  that,  Sidney  ...  everybody  knows  that!  ...  But  look: 

Four  wrongs  squared,  minus  two  wrongs  to  the  fourth  power,  divided  by 
this  formula,  do  make  a  right.”  Gary  Larson 
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Abstract  ^ 

A  method  is  proposed  for  designing  multiple  model  adaptive  estimators- to 
provide  combined  state  and  parameter  estimation  in  the  presence  of  an  uncertain 
parameter  vector.  It  is  assumed  that  the  parameter  varies  over  a  continuous  region 
and  a  finite  number  of  constant-gain  filters  is  available  for  the  estimation.  The  esti¬ 
mator  elemental  filters  are  chosen  by  minimizing  a  cost  functional  representing  the 
average  state  prediction  error  autocorrelation,  with  the  average  taken  as  the  true  pa¬ 
rameter  ranges  over  the  admissible  parameter  set.  A  second-order  example  is  used  to 
illustrate  the  increase  in  performance  over  previously  accepted  filter  selection  meth¬ 
ods.  By  minimizing  a  cost  functional  representing  the  average  parameter  prediction 
error  autocorrelation,  a  parameter  estimator  is  designed  which  is  different  from  the 
state  estimator.  The  parameter  estimator  found  with  this  method  provides  lower 
average  mean  square  parameter  estimation  error  than  previously  accepted  design 
methods. 

An  analogous  method  is  proposed  for  designing  multiple  model  adaptive  con¬ 
trollers  to  provide  stabilizing  control  in  the  presence  of  an  uncertain  parameter  vec¬ 
tor.  A  finite  number  of  constant-gain  controllers  is  used  to  regulate  a  system  with  a 
parameter  vector  that  varies  over  a  continuous  region  of  the  parameter  space.  The 
controller  elemental  filters  are  chosen  by  minimizing  a  cost  functional  representing 
the  average  regulation  error  autocorrelation,  with  the  average  taken  as  the  true  pa¬ 
rameter  ranges  over  the  admissible  parameter  set.  A  second-order  example  is  used 
to  demonstrate  the  improvement  in  performance  over  previously  accepted  controller 
selection  methods.  Modifications  to  the  estimator  and  controller  optimization  algo¬ 
rithms  are  investigated  which  provide  the  additional  capabilities  of:  changing  the 
filter  selection  mechanization,  using  different  types  of  cost  functionals,  and  placing 
lower  bounds  on  the  model  probabilities  to  keep  the  system  open  for  adaptation. 
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AN  OPTIMAL  PARAMETER  DISCRETIZATION 
STRATEGY  FOR  MULTIPLE  MODEL 
ADAPTIVE  ESTIMATION  AND  CONTROL 


I.  Introduction 


I.l  Overview 

This  thesis  presents  an  area  of  research  in  the  design  of  adaptive  estimators 
and  controllers.  It  centers  around  multiple  model  adaptive  estimation  (MMAE), 
using  a  bank  of  Kalman  filters  for  estimating  the  states  and  parameters  of  a  system. 
MMAE  is  used  as  the  basis  for  multiple  model  adaptive  control  (MMAC)  systems 
which  provide  control  in  the  presence  of  parameter  variation.  Section  1.2  provides 
some  background  on  MMAE  and  MMAC. 

The  design  method  proposed  herein  involves  optimally  choosing  the  models 
upon  which  to  base  the  elemental  filters  of  MMAE  systems,  or  the  elemental  con¬ 
trollers  of  MMAC  systems,  with  parameters  that  can  vary  over  a  continuous  range. 
The  problem  is  choosing  a  finite  subset  of  representative  parameters  from  an  infinite 
parameter  set.  Section  1.3  describes  the  model  selection  problem  encountered  when 
using  MMAE  that  is  solved  through  this  research. 

Section  1.4  outlines  the  scope  of  this  research  effort  through  the  statement  of 
three  hypotheses  which  are  supported  in  the  following  chapters.  Section  1.5  outlines 
the  approach  taken  in  this  research  effort  and  provides  some  direction  for  developing 
similar  design  methods.  Section  1,6  presents  the  major  assumptions  which  facilitate 
this  method  and  outlines  the  approximations  made  in  the  theoretical  development. 
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1.2  Background 

MMAE  is  accomplished  through  the  use  of  a  multiple  model  filtering  algorithm 
(MMFA)  [1,  6,  33,  36].  The  MMFA  has  been  shown  to  perform  well  as  a  filter  for 
tracking  airborne  targets  [7,  12,  31,  32,  38,  41,  46,  49,  55,  59,  60,  61],  for  tracking 
submarines  [47],  for  estimating  parameters  in  space  structure  control  systems  [10, 
23,  24,  29,  30],  for  estimating  parameters  in  flight  control  systems  [3,  5,  34,  54], 
for  estimating  the  position  of  a  neutral  particle  beam  [15,  22,  42,  43,  44,  65],  for 
estimation  in  a  terrain  assisted  navigation  system  [56],  and  for  estimation  in  a  tank 
fire  control  system  [64].  The  AFIT  robotics  research  group  proposes  using  MMAE 
to  estimate  the  changing  parameters  in  a  robotic  manipulator  [57,  58].  This  research 
is  concerned  with  choosing  the  models  in  an  MMFA  to  minimize  the  mean  squared 
estimation  error. 

1.2.1  Multiple  Model  Adaptive  Estimation.  The  MMFA  is  an  algorithm  orig¬ 
inally  proposed  by  Magill  [33]  to  provide  accurate  state  estimation  under  parameter 
variation.  The  true  parameter  vector,  a*  ,  is  assumed  to  be  a  member  of  a  finite  set  of 
possible  vectors,  {a^  a^, . . . ,  a^}.  As  shown  in  Figure  1,  a  bank  of  Kalman  filters  is 
constructed  with  each  filter  based  upon  the  hypothesis  that  a*  =a*  ,  A:  =  1, 2, . . .  ,A'  . 
The  filters  run  in  parallel,  and  the  residuals  in  each  filter  are  monitored  by  an  exec¬ 
utive  routine.  This  routine  calculates  the  hypothesis  conditional  probability, 

=  prob{a*  =  a*  |  Z(<,)  =  Z,},  for  each  of  the  elemental  filters,  whereZ,  represents 
a  realization  of  the  measurement  history,Z(t,)  ,  which  itself  is  composed  of  partitions 
that  are  the  individual  measurements  at  each  sample  time,  z(ti),  z(<2),  •  •  • ,  z(t,). 

The  conditional  densities  are  used  to  update  this  probability  at  each  sample 
instant  by  the  calculation  [53j: 


P  (A)  — 


/z(t.)|a,Z(t,_i)(z.-  I  a*,Z._i)p*(t.-l) 


Tlj=i  /z{t,)|a,Z(t,_i)(2i’  1  Z,_i)p^(i,_i) 
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Figure  1.  Multiple  Model  Filtering  Algorithm 
where  the  conditional  density  for  the  current  measurement,  z(t,),  is  given  by: 

/z(M|a,zp._,)(z.  I  a^Z,_^)  =  I  |i/2 

(2) 

wherer*(t,)  is  the  residual  vector  and  is  the  anticipated  residual  covariance 

generated  in  the  Kalman  filter  as 


The  initial  probability  distribution  p*(to)  is  chosen  b^lsed  on  any  prior  information 
available.  For  example,  if  each  model  is  equally  likely,  the  a  priori  distribution  could 
be  set  to  p*(to)  =  1/K,  k  =  1,2,. ..,K. 

There  are  two  accepted  methods  to  calculate  state  and  parameter  estimates 
once  one  has  the  conditional  probability  distribution.  The  Bayesian  method  produces 
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a  state  estimate  which  is  the  conditional  mean  of  the  state  vector  x.  The  conditional 
mean  of  x  is 

x(*n  =  E*‘(C>p‘('0  (4) 

*=1 

where  x(tj*')  denotes  the  estimate  at  time  f,  after  the  mejisurement  update.  The 
term  x(^,~)  used  in  the  following  chapter  to  denote  the  estimate  at  time  t,  before 
the  measurement  update.  The  parameter  estimate,  the  conditional  mean  of  a,  can 
be  found,  if  desired,  by  calculating 

a(^)  =  EaV(t.)  (5) 

*:=1 

The  maximum  a  posteriori  (MAP)  method  chooses  the  state  and  parameter 
estimates  as  the  values  in  the  model  with  the  highest  probability: 

=  for  k  such  that  =  max{p^(t,)}  (6) 

3 

A  major  assumption  underlying  this  research  is  that  the  MMFA  converges  to 
the  propel  model.  Baxain  showed  in  his  dissertation  [4]  that  both  the  Bayesian  and 
MAP  methods  converge  in  steady  state  to  the  model  closest  to  the  actual  parameter 
value  with  respect  to  his  information  measure.  Therefore,  this  research  is  applicable 
to  either  of  these  methods  of  model  selection,  though  in  ^lny  particular  implementa¬ 
tion,  either  method  has  advcintages  and  disadvantages.  The  MAP  method  chooses 
the  model  which  is  most  probably  the  correct  filter.  However,  as  the  selected  fil¬ 
ter  changes,  the  state  estimate  has  jump  discontinuities.  The  Bayesian  method  has 
much  smoother  transitions  between  filters  as  the  par^lmeter  chcinges.  However,  the 
state  estimate  has  a  contribution  from  filters  with  a  low  probability  of  being  the 
correct  filter. 

In  this  research,  the  MMFA  is  restricted  to  have  steady-state  constant-gain 
Kalman  filters.  That  is,  each  filter  will  be  based  on  a  time  invariemt  model  with 
stationary  noises,  and  the  precalculated  steady  state  value  of  the  covairiance  matrix 
will  be  used  to  calculate  the  gain,  thus  ignoring  any  natural  initial  transient.  This 
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Figure  2.  Multiple  Model  Control  Algorithm 

also  implies  that  the  steady-state  value  of  Af*  is  used  in  the  conditional  density  of 
Equation  (2). 

1.2.2  Multiple  Model  Adaptive  Control.  MM  AC  is  accomplished  through  the 
use  of  a  multiple  model  control  algorithm  (MMCA)  [3,  13,  l  i,  27,  37].  The  MMCA 
is  an  extension  of  the  MMFA  in  which  each  elemental  filter  is  appended  with  a  gain 
to  form  the  elemental  controllers  as  shown  in  Figure  2.  These  state  feedback  control 
laws  are  calculated  for  each  model  by  any  suitable  method,  such  as  LQG  synthesis  or 
eigenstructure  assignment.  The  control  to  the  plant  is  then  calculated  as  an  average 
of  the  control  signals,  weighted  by  the  conditional  probability  as  follows: 

u-  (i,)  i  y;  u‘  ((,)p‘{(0  (7) 

A;=l 

Aside  from  the  publications  deriving  and  explaining  the  multiple  model 
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formulations  [1,  6,  8,  20,  26,  27,  33,  45],  research  ha^  been  done  on  convergence 
of  the  estimator  [4,  17,  18],  setting  bounds  on  the  error  of  the  estimator  [17,  18], 
analysis  of  the  oscillation  phenomena  in  an  MMCA  [13,  14],  estimation  of  dis¬ 
tributed  systems  [2l],  decentralized  estimation  and  control  [16],  moving  bank  struc¬ 
tures  [19,  23,  24,  29,  30,  39],  and  methods  for  choosing  the  representative  models 
[24,  28,  30,  34,  39,  52]. 

1.3  Problem  Statement 

The  theoretical  development  of  the  MMFA  assumes  the  true  parameter  vector 


is  a  member  of  a  finite  set  of  possible  vectors,  {a*, a^, . . . ,  a^).  In  most  physical 
systems,  each  parameter  varies  over  a  continuous  bounded  range 

a,'  €  .Ai  =  [a,-,  a^  ]  for  i  =  1, 2, . . . ,  r 

which  is  called  the  admissible  set  of  parameter  values.  This  means  there  is  an 
infinite  number  of  possible  values  for  each  parameter.  It  is  impossible  to  design  a 
bank  containing  an  infinite  number  of  filters,  so  a  finite  subset  of  parameters  must 
be  chosen  from  this  set.  The  models  in  the  MMFA  or  MMCA  are  based  on  this  finite 
subset  of  parameter  values. 

To  madce  MMAE  and  MMAC  effective  and  attractive  to  the  practicing  engineer, 
there  needs  to  be  a  practical  method  for  choosing  the  finite  subset  which  ensures 
that  it  represents  the  admissible  set  of  parameter  values.  A  subset  of  parameters 
is  considered  a  representative  subset  if  an  MMFA  or  MMCA  based  on  this  subset 
performs  adequately  when  the  true  parameter  vector  is  anywhere  in  the  admissible 
set.  The  question  “What  is  adequate?”,  leads  to  the  definition  of  the  problem: 
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Problem:  Previously  accepted  methods  of  choosing  representative  pa¬ 
rameters  from  the  admissible  set  of  parameter  values,  for  an 
MMFA  or  MMCA,  are  ad  hoc  and  do  not  provide  a  means 
for  measuring  the  quality  of  the  designer’s  choice  of  models. 


1.  Sengbush  &  Lainiotis  proposed  in  [52]  two  algorithms  for  binary 
quantization  of  the  parameter  space. 

2.  Lamb  &  VVestphal  used  a  simplex  method  of  nonlinear  program¬ 
ming  in  [28]  to  direct  the  parameter  selection. 

3.  Matthes  used  performance  boundaries  from  simulations  to  define 
the  subset  covered  by  each  of  his  models,  then  chose  enough  models 
to  cover  the  entire  set  of  interest  in  [34j. 

In  applications  research  at  .AFIT,  several  methods  of  model  selection  have  been 
implemented.  One  of  the  simplest  methods  is  to  divide  the  parameter  set  linearly 
so  the  models  are  an  equal  distance  apart  in  the  parameter  space  [19,  23,  24,  39]. 
Equal  spacing  on  a  logarithmic  scale  has  been  used  for  estimating  a  break  frequency 
parameter  [19,  39].  When  this  method  did  not  provide  good  results.  Lashlee  [29,  30] 
defined  a  nominal  model  and  chose  neighboring  models  based  on  the  amount  of 
performance  degradation  experienced  when  the  true  parameter  value  w’as  that  of 
a  neighboring  model.  For  tracking  problems,  models  were  chosen  based  on  the 
dynamics  expected  from  the  target  [41,  55,  60,  61]. 

These  results  and  the  work  done  by  Matthes  [34]  point  toward  using  the  per¬ 
formance  of  the  estimator  (and/or  controller)  as  the  best  choice  of  model  selection 
criterion.  Therefore,  this  research  is  aimed  toward  developing  a  method,  based  on 
estimation  (control)  performance,  for  optimizing  the  model  selection  for  a  MMF.4 
(MMCA)  to  provide  the  best  estimation  (control)  with  the  least  number  of  models. 
The  primary  goal  of  this  research  has  been  the  development  of  an  optimization  pro¬ 
cedure  for  choosing  the  discretization  of  a  parameter  space.  This  entails  defining  a 
measure  of  quality  for  a  particular  choice  (i.e.,  a  cost  functional  to  minimize)  and 
choosing  a  method  to  accomplish  the  minimization.  There  are  two  approaches  that 
would  be  beneficial  to  a  design  engineer: 
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1.  When  faced  with  limitations  on  the  amount  of  computing  power  available,  it 
would  be  desirable  to  choose  the  parameter  discretization  which  would  pro¬ 
vide  the  lowest  estimation  or  tracking  error  based  on  a  prespecified  number  of 
models. 

2.  When  faced  with  constraints  on  performance,  it  would  be  desirable  to  choose 
the  discretization  based  on  the  amount  of  estimation  or  tracking  error  that 
can  be  tolerated.  The  idea  would  be  to  choose  the  fewest  number  of  model 
parameters  from,  the  parameter  set  to  achieve  at  least  the  desired  performance. 


The  ne.xt  section  sets  forth  limits  on  the  scope  of  this  research  so  that  the 
primary  goal  is  achieved  through  focused  effort. 


/../  Scope 

d  he  scope  of  this  effort  is  the  development  of  relationships  and  methods  to 
support  the  following  hypotheses: 


Hypothesis  ia  There  exists  a  representative  subset  of  the  admissible  param¬ 
eter  set,  which  forms  the  basis  of  an  MMFA,  of  prespecified 
size,  to  provide  state  estimation  with  minimum  estimation 
error. 


Hypothesis  lb  There  is  a  minimum  number  of  parameters  in  any  repre¬ 
sentative  subset  of  the  admissible  parameter  set.  forming 
the  basis  of  an  MMFA.  that  provide  state  estimation  with  a 
prespecified  bound  on  the  estimation  error. 

Hypothesis  2a  There  exists  a  representative  subset  of  the  admissible  param¬ 
eter  set,  which  forms  the  basis  of  an  MMF.A.  of  prespecified 
size,  to  provide  parameter  estimation  with  minimum  esti¬ 
mation  error. 


Hypothesis  2b  There  is  a  minimum  nuw.ber  of  parameters  in  any  repre¬ 
sentative  subset  of  the  admissible  parameter  set,  forming 
the  basis  of  an  MMF.A,  that  provide  parameter  estimation 
with  a  prespecified  bound  on  the  estimation  error. 

Hypothesis  3a  There  exists  a  representative  subset  of  the  admissible  param¬ 
eter  set,  which  forms  the  basis  of  an  MMC.A,  of  prespecified 
size,  to  provide  control  with  minimum  tracking  error. 

Hypothesis  3b  There  is  a  minimum  number  of  parameters  in  any  represen¬ 
tative  subset  of  the  admissible  parameter  set,  forming  the 
basis  of  an  MMCA,  that  provides  control  with  a  prespecified 
hound  on  the  tracking  error. 
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These  hypotheses  all  deal  with  finding  a  system  which  provides  some  kind  of 
optimal  performance.  To  say  that  one  system  is  optimal  makes  it  necessary  to  define 
some  measure  of  quality  by  which  to  judge  the  performance  of  any  candidate  system. 
This  leads  to  the  requirement  for  a  functional,  J,  to  be  minimized,  which  represents 
system  performance. 

Some  designers  might  choose  to  minimize  the  maximum  amount  of  error  one 
would  expect  to  see  if  the  true  parameter  vector,  a*,  is  anywhere  within  the  adnnssi- 
ble  subset,  A,  of  the  parameter  space.  One  suitable  measure  is  the  maximum  mean 
squared  error; 

=  maxE{[x-x]‘''W[x-x]} 

=  max  E{tr[x^Wx]} 

=  m^^  E{tr[W)cx^]} 

=  m^^  tr[WE{xx'’^}] 

=  m^tr[W^| 

vvhereE{»}  is  the  expectation  operator  and’®'  is  the  autocorrelation  matrix  of  the 
estimation  error  vector,  x. 

In  this  caseW  is  a  matrix  of  weighting  coefficients.  If  a  vector  of  linear 
combinations  of  the  error,  y  =  C[x  —  x],  is  of  interest,  this  functional  can  he  used 
with  W  =  C^W'C.  The  example  in  Chapter  III  illustrates  the  use  of  W  to  alter 
the  optimal  discretization  when  the  states  are  of  varying  interest. 

It  is  ecisy  to  see  where  problems  may  occur  using  this  functional  in  a  numerical 
optimization  problem.  If  the  maximum  error  occurred  between  the  and  filter, 
additional  filters  could  be  added  anywhere  but  between  these  filters  without  chang;ing 
the  value  of  the  functional.  The  functional  chosen  for  this  research  program  is  the 
average  value  of  the  mean  squared  estimation  error,  where  the  average  is  taken  as 
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the  true  parameter  ranges  over  the  entire  admissible  parameter  set: 


where 


2  A  J^E{[x-xl'^W[x-x]}da 


/  da  =  /  •••  /  /  daida2...dar 

JA  JAr  JAi  JAi 


(9) 


This  functional  was  chosen  because  it  provides  information  about  both  the 
mean  and  variance  of  the  estimation  error  across  the  range  of  admissible  parameter 
vectors.  There  are  any  number  of  other  cost  functionals  which  one  might  want  to 
minimize.  One  such  cost  functional  is  a  linear  combination  of  Equations  (8)  and 
(9)  to  force  good  overall  performance  combined  with  a  limit  on  the  maximum  error 
autocorrelation.  This  is  investigated  in  Section  3. 1.5. 3  for  the  example  with  a  cost 
functional  of  the  form: 

J=  AP  +  (1-A)J~  (10) 


Besides  performance  based  on  estimation  error,  one  might  consider  perfor¬ 
mance  of  the  estimator  based  on  convergence  to  the  proper  model.  One  might 
want  to  choose  the  representative  parameter  set  to  enhance  convergence  properties. 
Measures  such  as  the  curvature  of  an  ambiguity  function  [36]  might  be  maximized  to 
enhance  identifiability.  There  may  be  some  way  to  use  Baram’s  information  measure 
[4]  to  formulate  a  functional  which  can  be  used  to  enhance  identifiability.  Functionals 
such  as  these  are  not  addressed  in  this  dissertation. 

In  some  cases,  accurate  estimation  of  the  states  may  not  be  as  important  as 
accurate  estimation  of  the  parameter.  Figure  3  illustrates  one  way  a  parameter 
estimator  might  be  used.  The  functional  chosen  to  investigate  parameter  estimation 

,.AX,E([a-a]'^W[a-a]}da 

/^da  '  ' 

It  is  shown  in  Chapter  III  that  minimizing  this  functional  does  not  necessarily  pro¬ 
duce  the  same  result  as  minimizing  the  state  estimation  functional  in  Equation  (9). 
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Figure  3.  Parameter  Estimation  in  a  Control  System 


Figure  4.  A  Multiple  Model  Adaptive  Regulator 


11 


When  a  controller  comes  into  play,  it  is  desirable  to  minimize  the  tracking  error. 
Figure  4  is  a  block  diagram  of  an  MMCA  used  as  a  regulator  where  the  objective* 
is  to  control  the  output  vector,  y(<,),  to  zero.  The  cost  functional  for  this  system, 
analogous  to  Equation  (9),  can  be  written  as  follows: 

,,  A  /^E(yTWy)d» 

As  is  the  case  with  the  state  estimation  problem,  there  are  any  number  of  other 
parameter  estimator  or  controller  cost  functionals  which  one  may  want  to  minimize, 
and  this  research  does  not  preclude  using  any  of  them  for  a  similar  design  effort. 
The  development  shown  in  the  sequel  may  be  used  to  construct  analogous  design 
methods. 

1.5  General  Approach 

This  section  discusses  in  detail  the  approach  taken  to  support  the  hypotheses 
presented  in  Section  1.4.  Included  is  a  graphical  representation  of  the  steps  taken 
in  this  research  to  show  how  the  design  method  was  developed  and  evaluated.  This 
section  is  meant  to  provide  the  reader  with  a  roadmap  of  the  research  summarized 
in  this  dissertation,  and  to  provide  insight  about  how  other  similar  design  methods 
might  be  developed. 

Hypothesis  la  There  exists  a  representative  subset  of  the  admissible  param¬ 
eter  set,  which  forms  the  basis  of  an  MMFA,  of  prespecified 
size,  to  provide  state  estimation  with  minimum  estimation 
error. 

Approach  1.  Stwt  with  a  single  input  single  output  (SISO)  linear 
plant  with  one  parameter  (r  =  1)  driven  by  white  Gaussian 
noise  with  known  values  of  dynamic  driving  noise  strength 
(Q)  and  measurement  noise  covariance  (R). 

2.  Assume  the  filters  are  constant-gain  steady-state  Kal¬ 
man  filters  calculated  with  the  values  of  Q  and  R  from  the 
truth  model. 

3.  Assume  the  number  of  models  is  specified. 
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4.  Assume  the  cost  criterion  is  based  on  steady  state  error 
with  the  estimator  converging  to  the  model  closest,  in  the 
information  measure  of  Baram  [4],  to  the  true  parameter 
value  (i.e.  assume  there  is  enough  information  in  the  mea¬ 
surements  to  force  the  estimator  to  converge  to  the  selected 
filter  with  probability  1). 

5.  Derive  a  relationship  between  the  estimation  error  of  a 
model  and  the  distance  of  that  model’s  assumed  parameter 
value  from  the  true  parameter  value. 

6.  Develop  a  procedure  for  numerically  minimizing  the  cost 
functional. 

7.  Evaluate  the  basic  optimization  algorithm  on  a  single 
input  single  output  (SISO)  example  problem  with  one  pa¬ 
rameter  and  no  weighting  matrix. 

8.  Generalize  the  procedure  to  account  for  a  vector  of  r  pa¬ 
rameters,  and  to  account  for  multiple  input  multiple  output 
(MIMO)  systems. 

9.  Evaluate  the  effect  of  different  weighting  matrices  on 
the  example  problem  to  exhibit  the  benefits  of  this  design 
method. 

10.  Evaluate  the  effect  of  placing  a  lower  bound  on  the 
probability  to  provide  for  an  estimator  which  can  adapt  to 
varying  parameters. 

11.  Investigate  other  types  of  estimator  mechanizations  in¬ 
cluding  a  maximum  entropy  with  unit  covariance  (ME/I) 
hypothesis  [25]  in  the  hypothesis  conditional  probability  cal¬ 
culation.  This  mechanization  forces  convergence  to  the  filter 
with  the  lowest  residual  autocorrelation,  E{r*^(f,)r*(f,)}, 
instead  of  the  lowest  E{r^^(f,)[Af per  Equa¬ 
tion  (2). 

12.  Investigate  alternative  cost  functionals  including  a  max¬ 
imum-error-type  cost  as  shown  in  Equation  (10). 

Contribution  An  automated  method  for  selecting  the  models  in  an  MMFA. 

Notes  A  graphical  representation  of  this  approach  is  shown  in  Fig¬ 
ure  5  along  with  the  interrelation  between  this  hypothesis 
and  the  other  hypotheses. 
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Hypothesis  lb 


Approach 


Contribution 

Notes 


Hypothesis  2a 


Approach 


Contribution 

Notes 


There  is  a  minimum  number  of  parameters  in  any  repre¬ 
sentative  subset  of  the  admissible  parameter  set,  forming 
the  basis  of  an  MMFA,  that  provide  state  estimation  with  a 
prespecified  bound  on  the  estimation  error. 

1.  Use  the  minimization  procedure  developed  above  to  de¬ 
fine  a  mapping  from  the  number  of  models  to  the  expected 
cost. 

2.  Choose  the  minimum  number  of  models  from  this  list 
which  provides  at  least  the  level  of  performance  required. 

A  method  for  determining  the  number  of  models  required 
for  a  specified  allowable  error. 

Requires  accomplishing  the  tasks  in  Hypothesis  la. 


There  exists  a  representative  subset  of  the  admissible  param¬ 
eter  set,  which  forms  the  basis  of  an  MMFA,  of  prespecified 
size,  to  provide  parameter  estimation  with  minimum  esti¬ 
mation  error. 

1.  Starting  with  the  basic  system  and  algorithm  developed 
for  the  state  estimator,  restructure  the  algorithm  to  mini¬ 
mize  the  parameter  estimation  error. 

2.  Evaluate  the  parameter  estimation  algorithm  using  the 
example  problem. 

3.  Evaluate  the  effect  of  different  weighting  matrices  on 
the  example  problem  to  exhibit  the  benefits  of  this  design 
method. 

4.  Evaluate  the  effect  of  placing  a  lower  bound  on  the  proba¬ 
bility  to  provide  for  an  estimator  which  can  adapt  to  varying 
parameters. 

5.  Investigate  other  types  of  estimator  mechanizations  in¬ 
cluding  a  maximum  entropy  with  unit  covariance  (ME/I) 
hypothesis  [25]. 

An  automated  method  for  selecting  the  models  in  a  param¬ 
eter  estimating  MMFA. 

A  graphical  representation  of  this  approach  is  shown  in  Fig¬ 
ure  6. 
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Figure  6.  Graphical  Representation  of  Hypothesis  2a 


Hypothesis  2b  There  is  a  minimum  number  of  parameters  in  any  repre¬ 
sentative  subset  of  the  admissible  parameter  set,  forming 
the  basis  of  an  MMFA,  that  provide  parameter  estimation 
with  a  prespecified  bound  on  the  estimation  error. 

Approach  1.  Use  the  minimization  procedure  developed  above  to  de¬ 
fine  a  mapping  from  the  number  of  models  to  the  expected 
cost. 


2.  Choose  the  minimum  number  of  models  from  this  list 
which  provides  at  least  the  level  of  parameter  estimation 
accuracy  required. 

Contribution  A  method  for  finding  the  number  of  models  required  for  a 
specified  allowable  parameter  estimation  error. 

Notes  Requires  accomplishing  the  tasks  in  Hypothesis  2a. 


Hypothesis  3a  There  exists  a  representative  subset  of  the  admissible  param¬ 
eter  set,  which  forms  the  basis  of  an  MMCA,  of  prespecified 
size,  to  provide  control  with  minimum  tracking  error. 
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Approach 


Contribution 

Notes 

Hypothesis  3b 

Approach 

Contribution 

Notes 


1.  Starting  with  the  basic  system  and  algorithm  developed 
for  the  state  estimator,  restructure  the  algorithm  to  mini¬ 
mize  the  truth  state  autocorrelation.  Define  the  method  of 
controller  design,  specifying  all  necessary  design  variables. 
The  method  of  choice  is  LQG  constant  gain  controllers,  be¬ 
cause  the  control  gain  can  be  automatically  calculated  for 
any  given  parameter  based  upon  designer  specified  weight¬ 
ing  matrices.  Only  constant  gain  filters  and  controllers  are 
addressed  in  the  sequel. 

2.  Evaluate  the  controller  optimization  algorithm  using  the 
example  problem. 

3.  Evaluate  the  effect  of  different  weighting  matrices  on 
the  example  problem  to  exhibit  the  benefits  of  this  design 
method. 

4.  Evaluate  the  effect  of  placing  a  lower  bound  on  the  proba¬ 
bility  to  provide  for  an  estimator  which  can  adapt  to  varying 
parameters. 

5.  Investigate  other  types  of  estimator  mechanizations  in¬ 
cluding  a  maximum  entropy  with  unit  covariance  (ME/I) 
hypothesis  [25]. 

An  automated  method  for  selecting  the  models  in  an  MMCA. 

A  graphical  representation  of  this  approach  is  shown  in  Fig¬ 
ure  7. 


There  is  a  minimum  number  of  parameters  in  any  represen¬ 
tative  subset  of  the  admissible  parameter  set,  forming  the 
basis  of  an  MMCA,  that  provides  control  with  a  prespecified 
bound  on  the  tracking  error. 

1.  Use  the  minimization  procedure  developed  above  to  de¬ 
fine  a  mapping  from  the  number  of  models  to  the  expected 
cost. 

2.  Choose  the  minimum  number  of  models  from  this  list 
which  provides  at  least  the  level  of  performance  required. 

A  method  for  determining  the  number  of  models  required 
for  a  specified  allowable  error. 

Requires  accomplishing  the  tasks  in  Hypothesis  2a. 
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Figure  7.  Graphical  Representation  of  Hypothesis  3a 
1.6  Assumptions 

Some  assumptions  aje  necessary  to  bound  the  problem  and  form  the  basis  of  the 
mathematical  development  of  Chapter  II.  The  main  Jissumption  is  that  the  structure 
of  the  plant  is  known  except  for  an  r-vector  of  parameters,  which  is  aissumed  to  be 
a  member  of  an  infinite  set.  It  is  assumed  that  this  is  a  bounded  and  connected 
set  so  that  it  makes  sense  to  numerically  integrate  over  the  set  in  order  to  find  the 
average  mean  square  error.  This  is  not  really  necessary,  but  it  gives  the  problem  more 
physical  meaning.  It  is  cissumed  that  there  are  a  finite  number  of  filters  available  for 
the  estimation,  or  a  finite  number  of  controllers  available  for  the  control.  The  reason 
is  obvious;  however,  it  is  also  assumed  that  the  designer  cein  specify  this  number. 

The  pseudo-probabilities  will  be  calculated  per  Equation  (1).  The  probabilities 
are  referred  to  as  pseudo-probabilities  because  it  is  impossible  to  have  an  infinite 
bank  of  filters,  hence,  the  MMFA  might  be  calculating  the  probability  that  each 
filter  is  the  correct  one,  when  in  fzw:t  none  of  them  axe.  There  aje  other  methods  of 
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calculating  a  pseudo-probability  [25],  one  of  which  is  discussed  in  Section  2. 1.3.1. 

It  is  assumed  throughout  that  the  Bayesian  rule,  given  by  Equation  (4),  is  used 
to  calculate  the  estimate.  One  l«ist  assumption  is  that  the  system  converges  to  the 
model  closest,  in  the  information  measure  of  Baram  [4],  to  the  true  parameter  value 
with  probability  1.  This  inherently  assumes  the  application  of  persistent  excitation, 
which  is  a  concept  well  known  in  the  field  of  adaptive  control  and  discussed  in  [48, 
Chapter  6].  Persistent  excitation  has  not  been  explored  in  this  research. 

1.1  Summary 

Having  been  shown  to  perform  well  in  a  number  of  studies  (see  Section  1.2), 
the  MMFA  and  MMCA  have  been  chosen  as  the  basis  of  this  research  into  adaptive 
estimation  and  control.  This  chapter  provides  some  background  material  on  the 
structures  of  these  multiple  model  algorithms. 

The  MMFA  (and  MMCA)  design  problem  has  been  defined  as  one  of  choosing 
a  representative  parameter  set,  for  the  basis  of  elemental  filter  (controller)  design, 
that  is  optimized  with  regard  to  some  measure  of  the  performance  of  the  resulting 
system.  The  performance  measure  chosen  for  this  research  is  the  mean  square  error 
averaged  over  the  admissible  parameter  set.  The  remaining  chapters  follow  the 
outlines  presented  in  Section  1.5  to  provide  am  organized  path  toward  solving  the 
MMFA  (and  MMCA)  design  problem. 
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II.  Development 


The  development  of  the  optimization  method  is  presented  in  this  chapter.  The 
equations  describing  the  variables  of  interest  for  state  variable  estimation,  parameter 
estimation,  and  state  variable  control  are  derived,  based  on  the  premise  that  a  finite 
number  of  filters  will  be  used. 

2.1  Optimizing  a  Multiple  Model  State  Estimator 

As  explained  in  the  previous  chapter,  it  is  desirable  to  minimize  the  cost  func¬ 
tional 


2  A  /^E{[x-x]'''W[x-x]}da 

where 

/  da  =  /  •  •  •  /  /  daida2 . . .  da,. 


(13) 


Since  it  is  cissumed  that  the  admissible  parameter  set  is  constant  for  a  given 
problem,  it  is  necessary  only  to  minimize  the  numerator  of  Equation  (13).  Moreover, 


E{[x  -  x]^W[x  -  x]}  =  E{tr  [[x-x]'''W[x-x]]}  (14) 

=  E{tr  [W[x  -  x][x  -  x]"*^] }  (15) 

=  tr(W^]  (16) 

Thus,  one  needs  to  compute  the  error  autocorrelation  matrix,  ^  =  E{[x— x][x— x]^}, 
as  a  function  of  the  parameter  vector  a.  This  is,  in  essence,  a  sensitivity  analysis  of 
the  effect  of  parameter  variation  on  each  of  the  filters  in  the  MMFA. 

Figure  8  illustrates  the  concept  for  a  single  filter  system  based  on  the  scalar 
parameter  a^.  The  plot  shows  the  value  of  tr[W^]  as  a  function  of  the  true  parameter 
a*.  The  curve  represents  how  the  error  autocorrelation  might  increase  as  the  true 
parameter  differs  from  the  parameter  used  to  design  the  filter.  Along  with  knowing 
the  error  autocorrelation  for  a  particular  filter,  it  is  necessary  to  develop  the  equations 


trW^ 


True  Parameter  (a*) 

Figure  8.  Sensitivity  Analysis  for  a  Scalar  Parameter 

which  describe  how  the  estimator  converges  to  a  particular  filter  at  a  particular  point 
in  the  parameter  space.  A  numerical  integration  technique  can  then  be  employed  to 
evaluate  the  functional  for  a  given  choice  of  filters. 

S.1.1  Derivation  of  the  Estimator  Equations.  The  equations  required  to  com¬ 
pute  the  error  autocorrelation  at  each  point  in  parameter  space,  and  the  filter  se¬ 
lection  equations  describing  how  the  estimator  converges  to  a  particular  filter,  are 
presented  in  following  sections. 

2.1.1. 1  Error  Autocorrelation  Equations.  We  want  to  find  out  what  the 
true  estimation  error  autocorrelation  is  when  we  use  a  model  based  on  a*': 

x‘(^+,)  =  +  G‘((i)w;((,)  (17) 

z‘((,)  =  H*((,)x‘((,)  +  v*(i,)  (18) 

which  varies  from  the  actual  system  with  the  truth  mode'  which  is  ba.sed  on  the  true 

parameter  a*: 

ti)x-{U)  +  (19) 

=  +  (20) 

and  where 

x^(<,)  is  the  n*  dimensional  state  vector  in  the  estimator  filter  models. 
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x*(i,)  is  the  n*  dimensional  state  vector  in  the  truth  model;  n’  need 
not  be  the  same  as  n*. 

ti)  is  the  n*  x  n*  state  transition  matrix  for  the  filter  model. 

ti)  is  the  n*  x  n*  state  transition  matrix  in  the  truth  system  model. 

Wj(ti)  is  an  s*-vector-valued  discrete-time  .vhite  Gaussian  noise  pro¬ 
cess  with  zero  mean  and  covariance  kernel 

E([wj((oi[w5(ii)r)  =  I 

0  t,  ^  t, 

is  an  s*-vector-valued  discrete-time  white  Gaussian  noise  pro¬ 
cess  with  zero  mean  and  covariance  kernel 

0  ti  ^  tj 

is  the  n*'  x  s*  noise  input  matrix  for  the  k*^  filter  model. 

Gj{ti)  is  the  n*  x  s*  noise  input  matrix  for  the  truth  model. 

z’’  is  the  m  dimensional  random  measurement  vector  as  modelled 
in  the  filters. 


z*  is  the  m  dimensional  random  vector  which  represents  the  “real 
world”  measurement  from  the  truth  model. 

H'’{ti)  is  the  m  X  n'’  measurement  matrix  as  modelled  in  the  filters. 

H*{ti)  is  the  m  X  n"  measurement  matrix  in  the  truth  system  model. 

is  an  m-vector- valued  discrete-time  white  Gaussian  noise  pro¬ 
cess  with  zero  mean  and  covariance  kernel 

E{[v‘((i)i[v‘{(,)r}  = 
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ti  =  t, 
0  ti  ^  tj 


Wj  and  V*  are  assumed  to  be  independent. 


v*(<,)  is  an  m-vector- valued  discrete-time  white  Gaussian  noise  pro¬ 
cess  with  zero  mean  and  covariance  kernel 

E(|V(l,)|[V((,)lT}  = 

wj  and  V*  are  assumed  to  be  independent. 

The  development  uses  steady  state  arguments  so  the  following  assumptions  are 
made  about  time  invariance; 


R'((,)  (.  =  tj 
0 


<1+1  —  <,  =  a  constant  Vi 

Vi 

=  Vt 
Vi 

/f*(i.)  =  H'  Vi 
G*(i.)  =  G*  Vi 
G*(i.)  =  G*  Vi 

The  filters  are  assumed  to  be  constant-gain  filters  (i.e.,  K”*  does  not  vary  with 
time).  For  the  example  shown  in  the  sequel,  the  steady  state  Kalman  filter  gain  is 
chosen.  It  is  important  to  bring  out  the  point  that  the  filters  in  the  MMFA  need 
not  be  steady  state  Kalman  filters.  For  this  development  to  apply,  it  is  necessary 
that  the  filters  have  the  same  structure  cis  a  Kalman  filter,  but  any  other  method 
of  calculating  a  constant  gain  that  produces  a  stable  estimator  (e.g.,  eigenstructure 
assignment)  may  be  used.  However,  to  use  this  method  effectively  for  minimizing 
the  error  autocorrelation,  an  automated  method  which  can  determine  AT*  for  any 
given  parameter  is  suggested.  Methods  which  require  hand  tuning  to  determine  the 
gain  are  not  recommended. 
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The  steady  state  Kalman  filter  gain  is  used  in  this  dissertation,  because  the 
true  Kalman  filter  is  the  minimum  mean  squared  error  estimator  for  a  known  system 
driven  by  zero  mean  white  Gaussian  noise.  Since  the  goal  of  this  research  is  to 
minimize  the  average  mear  square  error,  it  is  natural  to  use  the  steady  state  Kalman 
filter  gain. 

The  filter  selection  within  the  MMFA  is  determined  using  the  one-step  predic¬ 
tion  model  [4,  page  78]  as  is  shown  in  the  next  section.  For  this  reason,  the  one-step 
prediction  model  is  derived.  If  x*(f“ )  is  defined  as  the  state  estimate  in  the  filter 
at  time  <,  before  the  measurement  update,  and  as  the  state  estimate  in  the 

filter  at  time  f,  after  the  measurement  update,  we  develop  a  one-step  discrete 
model  of  the  system  at  t~ .  Note  that  the  true  state  vector  does  not  change  due  to 
activity  in  the  estimator  so  x*(f“)  =  x'’(f/')  =  x*(<,). 

Derive  the  one-step  prediction  model  from  the  standard  Kalman  filter  equations 
[35,  page  275]  as  follows: 


x''(<+)  =  x''(<-)  +  K'^{z-{U)  -  (21) 


(22) 


x''(f .;J  =  +  K'’[H-xV7)  +  v*(<.)  -  H*x‘(<r)]}  (23) 

and  since  we  have  no  measurement  update  cycle  on  the  true  system 

=  (fx-(i-)  +  (24) 

At  this  point  we  introduce  the  shorthand  notation  •“  =  '{t~)  and  can  write 
the  one  step  prediction  model  as  in  [4,  page  78]. 


0 


xf- 

v;  + 

0 

+ 

.  . 

0 

(25) 
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The  prediction  error  equation  is  now  of  the  form: 


<1 

• 

<x 

wV 

=  r 

t 

+  Go 

dt 

.  . 

(26) 


where 


r  = 


0 


G.  = 


0 


G-  0 


and 


w 


di 


is  an  s*  +  m-vector  valued  discrete  time  white  Gaussian  noise  process 


with  zero  mean  and  covariance  kernel 


E{ 


^di 


V.' 


K^vf]}  =  I 


Qo{ti)  u  =  t, 
0  <,  7^  t, 


with 


QoiU)  = 


Q:iti)  0 

0  R‘{t,) 

Since  wj  and  v*  are  assumed  to  be  independent  random  vectors,  they  are  uncorre¬ 
lated. 


This  stucture  allows  us  to  calculate  the  autocorrelation  of 
matrix  equation 


x^- 


J 


A 


L  J 


Afc-T  v-*~f 

■*■.+1  -^i+l 


=  =  rH‘rT  +  g.QoGJ 


using  the 


(27) 


If  T  is  a  contraction,  as  i  — >  oo,  S*  approaches  a  constant  value  which  is  the 
steady  state  prediction  error  autocorrelation,  and  is  denoted  S*  . 


However,  we  are  not  interested  in 


A 

X . 


1+1 


but  in  the  estimation  error.  Since 


the  filter  might  implement  a  reduced  order  model,  we  must  introduce  a  trajisfor- 
mation  matrix  T  which  transforms  the  true  state  vector  into  the  filter  state  space 
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by: 

^implemented  ~  ^^true 

so  let  x*=  X*  —  Tx*  denote  the  estimation  error  vector. 

We  want  to  find  the  autocorrelation  matrix: 

E{[ii‘-  -  rx'-|ix‘-  -  rx-'i'^}  =  e{1x‘-||x‘-|T} 

Hence,  we  want  to  derive 


(28) 


(29) 


1  . 

1 

'x*-  ‘ 

wr 

«+l 

x*~ 

L  '+1  J 

=  c 

• 

di 

.  . 

(30) 


to  be  able  to  calculate 


[ 

x*'- 

- 

E- 

1 

1 

«+i 

i 

*- 

L  ‘+1  J 

i-fc— T  •v*'~T 


=  c* .  =  c.  r‘‘C^  +  g:q.g7 


(31) 


We  begin  by  writing  the  equation  for  x*”^: 


Ak-  _  -v-^”  Tv*~ 

■^i+i  ~  -^i+i  *  -^i+i 


-  rG;w;. 

=  -  Ttf  *x*-  + 


Now  define 


and 


-  Tcy:, 


TA^  =  -  T^‘ 


AH  =  H’^T  -  if* 


Rearranging  these  equations  gives 

T^*  =  -  TA^ 

if*  =  if*r  -  AH 


(32) 

(33) 

(34) 

(35) 

(36) 

(37) 

(38) 
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Substitute  Equations  (37)  and  (38)  into  Equation  (34)  to  get 


-  «P*'Tx*-  +  TA^x;-  +  -  TG>*.  (39) 

Rearrange  and  collect  terms  to  obtain 

xf;^  =  -  ^^Tx-  -  +  ^'‘K^H^Tx'-  +  TA^x'r 

-  ^'^K'^AHx;-  +  -  TG>*,  (40) 

xf-,  =  ^"'(xf-  -  Tx*-)  -  ^“K'^H'^ix’;-  -  Tx*-)  +  TA^x.- 

-  ^‘‘K'^AHx'l-  +  -  rG>*.  (41) 

=  <p*xf-  -  <?*=  JC*H*=xf-  +  TA^x]-  -  ^^K^AHx'r 

+  0'^K^v:  -  (42) 


xf;^  =  (<f^  -  4i''i<:*ff‘)xf- +  (rji<f /ii3-)x*- 
+  -  TG^w*. 

and  as  in  the  previous  development 


We  can  now  write  the  one-step  prediction  error 


'x^'r  ' 

t-i-i 

_ 

x*r 

L  »+i  J 

L 

^*‘{1  -  TA^-^'^K'^AH 

0 


xf- 


+ 


-TG- 

’  w;  ■ 

G*  0 

V* 

L  ' 

(43) 


(44) 


(45) 
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which  is  of  the  form 


x^T 

xf- 

w* 

1+1 

=  jC 

at 

.  . 

V" 

t 

Thus,  we  can  calculate 


X^- 

t+1 

x*r 

L 

L  «+i  J 

-'t+i  -^.-1-1 


=  r* 

«+i 


=  cr^c^  +  g'QoG'J 


(46) 


(47) 


If  £  is  a  contraction,  as  i  oo,  Ff’  approaches  a  constant  value  which  is  the  steady 
state  prediction  error  autocorrelation,  and  is  denoted  F^. 

The  upper  left  partition  of  T'-*',  E{[x|‘“][x|'"]^},  is  ,  the  autocorrelation  of 
the  estimator  prediction  error  in  the  filter.  We  calculate  E{[xf“]^W[xf“]}  by: 


E{[x‘-rw[x‘-l} 


E{trl(xf-lTw|xM|) 

E{tr[W(x‘-)lx‘-|T]} 

tr[WE{lx*-l(xMT)l 

tr|W«‘-] 


(48) 


The  complete  cost  functional  can  be  computed  ais 


A 


/^tr[W^--^-]da 

/Ada 


(49) 


where  sel  denotes  the  filter  selected  at  any  particular  value  of  the  parameter  vector. 
We  assume  that  the  estimator  converges  to  one  particular  filter  so  that  p*'*  =  1,  and 
all  other  p*  are  zero.  The  equations  governing  which  filter  is  selected  are  developed 
in  the  next  section.  In  practice,  a  lower  bound  is  put  on  the  hypothesis  conditional 
probability  for  each  filter.  This  prevents  the  probability  of  any  filter  from  converg¬ 
ing  to  zero,  and  allows  the  estimator  to  adapt  to  a  varying  parameter.  In  this  Ccise, 
the  true  estimator  error  autocorrelation  for  a  Bayesian  MMFA  would  be  a  prob¬ 
abilistically  weighted  average  of  the  individual  error  autocorrelations  calculated  in 
Equation  (48).  This  is  explored  in  Sections  2. 1.3.3  and  3. 1.5. 4. 
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2.1. 1.2  Equations  Governing  Filter  Selection  Within  the  Estimator.  In 
his  PhD  dissertation  [4],  Baram  presents  the  conditions  for  convergence  within  the 
MMFA.  If  sufficient  conditions  exist  for  the  MMFA  to  converge,  it  will  converge 
to  the  filter  based  on  the  parameter  closest  to  the  true  parameter  with  respect  to 
Baram’s  information  metric.  Baram ’s  development  makes  it  necessary  to  define  some 
terms  before  stating  the  conditions  for  filter  selection. 

M*  is  the  steady  state  prediction  error  covariance  of  the  residuals  cal¬ 
culated  within  the  filter  by 

+  il*  (50) 

where  [P*]“  is  the  prediction  error  covariance  of  the  states  as 
calculated  in  the  filter. 

Since  the  models  for  the  filters  do  not  match  the  truth  model,  this  value  cal¬ 
culated  within  the  filter  is  not  the  true  prediction  error  covariance.  Assuming  the 
truth  model  of  Equations  (19)  and  (20)  is  accurate  and  the  filters  are  based  on 
Equations  (17)  and  (18), 

is  the  actual  steady  state  prediction  error  autocorrelation  in  the 
k^*'  filter  generated  by  [4,  Eqn.  5.9] 

iV*  =  [-H^  H'\^  ^  E:  (51) 

=  [-if*'  -AH]rl[-H'^  -f  H*  (52) 

=  [if*'  AH]rl[H*‘  AH]^  -f  R-  (53) 

where  is  the  steady  state  value  calculated  in  Equation  (27) 
and  is  the  steady  state  value  calculated  in  Equation  (47). 

is  defined  eis  the  proximity  of  the  A:***  filter  generated  by 
[4,  Eqn.  5.16] 


£*  =  log^  1  M*  I  +tr[[M*']-‘^*' 


(54) 


If  the  conditions  exist  for  the  MMFA  to  converge  [4,  Theorem  5.2],  it  will 
converge  to  the  filter  governed  by 

=  min  &  j  =  1,. . .  ,K  (55) 

2.1.2  Estimator  Design  Algorithm.  The  following  is  a  procedure  by  which  the 
cost  functional  of  Equation  (13)  can  be  numerically  approximated  and  minimize*^ 

1. )  Start  by  describing  the  system  in  terms  of  the  parameter  vector  a,  spec¬ 
ifying  the  structure  of  the  truth  system  and  the  filters  to  be  implemented  in  the 
estimator. 

2. )  Choose  K,  the  number  of  filters  to  be  implemented  in  the  estimator. 

3. )  Choose  a  representative  parameter  set  to  begin  the  minimization.  A  rea¬ 
sonable  choice  might  be  made  by  dividing  the  admissible  parameter  region  into  equal 
intervals  and  choosing  the  midpoint  of  each  interval,  though  each  problem  may  re¬ 
quire  a  different  starting  point.  Insight  as  to  what  represents  a  reasonable  choice 
can  be  gained  by  plotting  the  optimal  estimation  error  autocorrelation.  That  is,  plot 
Equation  (48)  for  a  coarse  discretization  of  the  parameter  space  assuming  a*  =  a’  at 
each  point.  The  larger  tr[W'®'“]  is,  the  closer  the  representative  parameters  should 
be. 

4. )  Choose  a  numerical  integration  technique  which  will  be  used  to  evaluate 
Equation  (13).  This  integration  provides  the  value  of  the  functional  for  any  specific 
choice  of  representative  parameter  set.  Equation  (49)  defines  the  curve  as  a  function 
of  the  parameter,  and  is  used  to  evaluate  the  curve  at  any  required  point  in  the 
interval.  The  information  required  to  set  up  Equations  (45)  and  (47)  is  chosen  based 
on  the  proper  filter  selection  per  Equation  (55). 

5. )  Step  4  generates  a  numerical  approximation  to  the  value  J(a)  for  a  pre¬ 
specified  choice  of  parameter  vector  a.  The  problem  is  now  a  vector  minimization 
problem.  Choose  a  numerical  vector  minimization  technique  and  proceed  to  mini¬ 
mize  J(a)  using  the  procedure  in  step  4  to  evaluate  the  functional  for  each  parameter 
vector. 
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Step  4  is  computationally  intensive  because  an  algebraic  Riccati  equation  is 
solved  at  each  point  evaluated  on  the  curve.  Step  5  is  also  computationally  intensive 
because  derivative  information  is  not  explicitly  available,  and  must  be  computed 
with  difference  approximations.  The  numerical  integration  and  minimization  tech¬ 
niques  must  be  chosen  with  this  in  mind.  The  extended  trapezoidal  rule  [51,  Eqn. 
(4.1.11)]  is  the  numerical  integration  technique  used  in  the  example  of  Chapter  III. 
This  technique  is  a  reasonable  choice  because  it  provides  sufficient  accuracy  for  the 
example  proolem  and  involves  minimal  computaiionai  loading.  An  empirical  com¬ 
parison  of  several  techniques  (including  the  forward  rectangular  rule,  and  alternative 
extended  Simpson’s  rule  [51])  on  the  example  problem  resulted  in  negligible  difference 
(.00012  average  relative  change)  in  the  value  of  the  cost  functional.  The  extended 
trapezoidal  rule  also  has  the  desirable  characteristic  of  corresponding  closely  to  the 
method  used  to  analyze  simulation  results,  where  the  mean  square  error  is  averaged 
over  the  number  of  simulations. 

The  vector  minimization  is  done  using  a  Broyden-Fletcher-Goldfarb-Shanno 
(BFGS)  variable  metric  method  for  unconstrained  minimization  using  one-sided  dif¬ 
ference  approximations  in  the  software  package  ADS  [62].  This  method  wcis  chosen 
because  it  is  efficient  and  supposedly  requires  fewer  evaluations  than  the  Davidon- 
Fletcher- Powell  algorithm  to  achieve  the  same  level  of  accuracy  [11].  In  practice,  (for 
this  research  and  in  [63,  page  97])  there  is  no  discernible  difference  between  these  two 
methods.  The  techniques  chosen  produced  good  results  for  the  example  of  Chapter 
III. 


2.1.3  Modifications  to  the  Basic  Estimator  Optimization  Algorithm.  Recall, 
from  Section  1.5  and  specifically  Figure  5,  that  there  are  some  modifications  to  the 
basic  design  algorithm  which  are  desirable  from  a  practical  standpoint.  The  algo¬ 
rithm  as  presented  is  general  enough  to  account  for  MIMO  systems  and  parameter 
vectors.  It  also  incorporates  a  weighting  matrix,  W,  to  allow  the  designer  to  per¬ 
form  tradeoffs  between  states.  The  effect  of  this  weighting  matrix  is  explored  in 
Section  3. 1.5.1.  The  following  sections  explore  the  other  extensions  of  the  basic 
problem  which  require  modifications  to  the  btisic  optimization  algorithm. 
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2. 1.3.1  The  Maximum  Entropy  with  Identity  Covariance  (ME/I)  Hy¬ 
pothesis.  The  MMFA  described  in  Section  1.2.1  was  developed  based  on  the  as¬ 
sumption  that  one  of  the  filters  in  the  bank  is  based  upon  an  accurate  model  of  the 
system.  The  probabilities  calculated  by  Equation  (1)  would  be  true  probabilities  if 
this  was  so.  The  residuals  of  the  correct  filter  would  follow  a  Gaussian  distribution 
with  covariance  M*',  given  by  Equation  (3).  A  Gaussian  distribution  is  the  max¬ 
imum  entropy  distribution  for  the  given  covariance  Af*,  so  a  filter  based  on  this 
Iiypv.>thesls  is  called  an  ME/M  filter  [25]. 

The  measurement  conditional  density  function  for  the  filter  is  calculated 
as  follows; 


/z{t,)]a,Z(ti_i)(Zi  I  a^,Z,_i)  = 


(56) 

Since  we  are  approximating  a  system  having  an  uncountable  infinity  of  possible 
parameter  values,  with  an  MMFA  consisting  of  a  finite  number  of  filters,  the  correct 
model  is  almost  surely  different  from  any  filter  in  the  MMFA.  Hence,  the  probabilities 
calculated  are  pseudo-probabilities,  and  the  residuals  almost  surely  have  a  covariance 
different  from  any  AT*. 


If  we  assume  instead  that  the  residuals  are  Gaussian  with  a  covariance  matrix 
equal  to  the  identity  matrix,  we  are  assuming  the  residuals  follow  a  “maximally 
noncommittal  residual  distribution”  [25,  page  26].  Making  this  aissumption  would 
lead  us  to  use  a  Maximum  Entropy  with  Identity  covariance  or  ME/I  filter.  The 
measurement  conditional  density  function  for  the  filter  is  then  calculated  as 
follows: 


/z(t,)|a,Z(t,_i)(Zi  I  a*,Z,_i)  — 


The  MMFA  selects  the  filter  with  the  minimum  proximity  given  by 


=  log,  I  I  1  -|-tr[ J-‘iV*]  =  tr[N*] 


(58) 


where  iV*  is  given  by  Equation  (53).  Thus  the  filter  with  the  lowest  residual  auto¬ 
correlation  is  chosen. 
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We  can  use  this  property  of  the  ME/I  filter  to  our  advantage.  Since  the  hy¬ 
pothesis  conditional  probabilities  are  calculated  apart  from  the  individual  filters,  we 
can  use  the  ME/I  hypothesis  to  update  the  probabilities  without  changing  the  gain 
in  the  estimating  filters.  In  other  words,  the  calculations  internal  to  each  elemental 
filter  assume  that  the  covariance  of  the  residuals  in  that  filter  are  well  modelled  as 
whereas  the  decision  logic  external  to  each  elemental  filter  assumes  instead 
that  this  covariance  is  the  identity  matrix.  Thus,  an  MMFA  using  the  ME/I  hy¬ 
pothesis  selects  the  ME/M  filter  with  the  lowest  residual  error  autocorrelation.  By 
replacing  Equation  (54)  with  Equation  (58),  the  optimization  algorithm  developed 
in  Section  2.1.2  can  be  u«ed  to  optimize  such  a  system. 

2. 1.3. 2  Alternative  Cost  Functionals.  It  is  assumed,  in  the  development 
of  Section  2.1,  that  the  design  objective  is  to  minimize  the  average  mean  square  es¬ 
timation  error.  Many  system  specifications  include  upper  bounds  on  the  acceptable 
error.  To  address  these  specifications,  a  maximum-type  cost  functional  is  appropri¬ 
ate.  Such  a  cost  functional  is  the  maximum  mean  squared  error 

J~  =  maxtr[W®]  (59) 

It  is  easy  to  see  where  problems  may  occur  using  this  functional  in  a  numerical 
optimization  problem.  The  maximum  error  autocorrelation  is  likely  to  occur  between 
two  filters,  or  between  a  filter  and  a  boundary.  This  would  allow  us  to  move  the 
other  filter  parameters  without  affecting  the  cost,  making  the  derivative  of  the  cost 
in  a  gradient  optimization  algorithm  equal  to  zero  in  the  those  directions.  The 
optimization  algorithm  would  assume  that  the  cost  functional  was  minimized  in 
those  directions,  and  ignore  the  corresponding  values.  Hence,  numerical  optimization 
routines  would  experience  problems  due  to  the  flatness  of  the  cost  functional  surface 
over  the  parameter  space. 

To  preclude  this  minimization  problem,  we  propose  the  hybrid  cost  functional 

J  =  AJ2  -I-  (1  -  A)J~  (60) 

where  A  is  allowed  to  vary  between  a  small  c  and  1.  This  cost  functional  maintains 
the  characteristics  of  the  basic  cost  functional  while  placing  some  weight  on  the 
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maximum  prediction  error  autocorrelation.  It  is  a  simple  matter  to  keep  track  of  the 
maximum  error  autocorrelation  found  during  the  calculation  of  J^,  then  compute 
J  from  Equation  (60).  This  cost  functional  is  explored  in  Section  3. 1.5.3,  and  is 
shown  to  provide  an  MMFA  with  lower  maximum  mean  square  error  than  the  basic 
algorithm  at  the  cost  of  a  higher  average  mean  square  error. 

2. 1.3. 3  Avoiding  Estimator  Lockup.  Recall  that  the  conditional  densi¬ 
ties  are  used  to  update  the  probabilities  at  each  sample  instant  by  the  calculation 
[53]: 

\  /z(t.)|a,Z(ti_i)(z«-  I  a*, 

2^J=1 /z(«,)|a,Z(f,_i)(2i  1  a  ,  Zj_i 

Baram  proved  [4,  Thm.5.2]  that  the  MMFA  will  converge  to  the  filter  selected  by 
Equation  (54).  This  is  the  basis  for  assuming  that  the  probability  of  the  selected  filter 
is  one.  Once  the  probability  of  the  selected  filter  reaches  one,  the  other  probabilities 
are  zero  and  the  probability  calculation  will  experience  lockup.  One  can  see  from 
Equation  (61)  that  if  p*(f,_i)  =  0,  then  =  0. 

In  practice,  the  designer  will  select  a  lower  bound  for  the  probabilities  to  keep 
the  system  open  for  adaptation  if  the  true  parameter  changes.  Let  Pmin  denote 
the  lower  bound  on  the  probabilities.  Whereeis  it  is  assumed,  in  the  development 
of  Section  2. 1.1.1,  that  the  estimator  has  converged  and  the  selected  filter  has  a 
probability  equal  to  one,  the  state  estimate  actually  converges  to 

x(<n  =  (62) 

fc=i 

=  {[1  -  (A:  X  Pmin)]x*''(tf)}  +  X*(<,^)Pmin  (63) 

*=1 

where  sel  denotes  the  selected  filter  per  Equation  (55). 

The  calculations  must  be  modified  to  take  this  into  account.  If  we  construct 
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the  augmented  state  vector, 


f  *■(<,-) 


1.  x-((-)  J 

the  estimation  error  ran  be  defined  as 

*(<r)  =  *(<r)-rx-((-) 

k=l 

=  •••  -T]xa(<r) 

Defining  the  probability  matrix  as 

A,  =  [p'I„.  p‘I„.  -T] 

the  value  of  E{[x”]^W[x~]}  can  be  calculated  by 

E{[x-rw[x-]}  =  E{tr[[x-|Tw(x-)]) 

=  E{lriW(x-J[x"l^j} 

=  E(tr|W[A,x;l|A,x.-)'fl) 
=  E{trlW[A,x.-l(x:rAjl} 
=  trlWA,E{lx:llx;)'^)Aj) 


To  calculate  the  augmented  autocorrelation  matrix,  E{[x^][x„]^} 
struct  the  augmented  system,  from  Equation  (25),  as  follows: 


2.2  Optimizing  for  Parameter  Estimation 

One  of  the  desired  estimator  structures  discussed  in  Chapter  I  is  that  of  a 
system  where  an  estimate  of  the  parameter  is  needed  without  the  requirement  for 
a  good  state  estimate.  Figure  9  (which  is  a  repeat  of  Figure  3)  illustrates  such  a 
system.  A  procedure  analogous  to  that  of  Section  2.1.2  is  used  to  find  a  representative 
parameter  set  which  provides  the  best  parameter  estimation. 
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Figure  9.  Parameter  Estimator  Structure 

2.2.1  Parameter  Estimator  Design  Algorithm.  For  the  best  parameter  esti¬ 
mation  performance,  it  is  desirable  to  minimize  the  cost  functional 

,A/^E{[a-alTw(a-a))da 

In  analogy  with  the  discussion  below  Equation  (49),  it  is  assumed  that  the  estimator 
converges  to  one  particular  filter  so  that  p*®*  =  1,  and  all  other  p*  are  zero,  where 
sel  denotes  the  filter  selected  at  any  particular  value  of  the  parameter  vector.  The 
equations  governing  which  filter  is  selected  are  developed  in  Section  2. 1.1. 2.  The 
case  where  p*®^  <  1  is  explored  in  Sections  2.2.2  and  3.2.3. 

This  means  the  parameter  estimate,  a,  is  a*®\  where  a®®*  denotes  the  parameter 
vector  on  which  the  selected  filter  is  based.  The  cost  functional  becomes 

,2  A  U  E{(a  -  a“']'''W(a  -  a’''])da 

IJdi  <'=> 

Note  that  different  filters  may  be  selected  at  different  points  in  the  parameter  space. 

The  following  is  a  procedure  by  which  this  cost  functional  can  be  numerically 
approximated  and  minimized. 
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1. )  Start  by  describing  the  system  in  terms  of  the  parameter  vector  a,  spec¬ 
ifying  the  structure  of  the  truth  system  and  the  filters  to  be  implemented  in  the 
estimator. 

2. )  Choose  K,  the  number  of  filters  to  be  implemented  in  the  estimator. 

3. )  Choose  a  representative  parameter  set  to  begin  the  minimization.  A  rea¬ 
sonable  choice  may  be  made  by  equally  dividing  the  admissible  parameter  region 
and  choosing  the  midpoint  of  each  region  as  the  representative  parameter  set. 

4. )  Choose  a  numerical  integration  technique  which  will  be  used  to  evaluate 
Equation  (75)  for  any  given  choice  of  representative  parameter  set.  For  parameter 
minimization  it  is  not  necessary  to  evaluate  Equation  (48)  per  se.  However,  it  is 
still  necessary  to  evaluate  Equation  (47)  to  get  the  information  required  to  find  out 
which  filter  is  selected  using  Equation  (55).  The  value  of  a  needed  in  Equation  (74) 
is  the  value  of  a  forming  the  basis  of  the  filter  selected  at  the  evaluation  point. 

5. )  Step  4  generates  a  numerical  approximation  to  the  value  J(a)  for  a  prespec¬ 
ified  choice  of  parameter  vector  a.  The  problem  then  becomes  a  vector  minimization 
problem.  Choose  a  numerical  vector  minimization  technique  and  proceed  to  mini¬ 
mize  J(a)  using  the  procedure  in  step  4  to  evaluate  the  functional  for  each  parameter 
vector. 

As  is  the  case  with  the  state  estimator,  the  extended  trapezoidal  rule  [51,  Eqn. 
(4.1.11)]  is  the  numerical  integration  technique  used  in  the  parameter  optimization 
example  of  Chapter  III.  The  vector  minimization  is  done  using  a  Broyden-Fletcher- 
Goldfarb-Shanno  (BFGS)  variable  metric  method  for  unconstrained  minimization 
[62]  using  one-sided  difference  approximations. 

2.2.2  Modifications  to  the  Parameter  Estimator  Design  Algorithm.  The  pa¬ 
rameter  estimator  optimization  algorithm  embodies  most  of  the  same  calculations 
required  in  the  state  estimator  optimization  algorithm.  The  only  difference  is  that 
tr[W^^*^“]  is  replaced  by  E{[a  —  a-'rw[a  -  a"'  ]}  as  explained  in  step  4  above. 
The  algorithm  is  therefore  general  in  nature  and  the  modifications  to  parameter 
optimization  algorithm  are  the  same  as  the  modifications  to  the  state  estimator 


algorithm,  which  are  explained  in  Section  2.1.3,  with  the  exception  of  placing  lower 
hounds  on  the  probabilities.  If  pmin  denotes  the  lower  bound  on  the  probabilities, 
the  parameter  estimate  converges  to 

4(0  =  (76) 

k=i 

=  {|l-(A'xp™„)la“'(C))4l;a‘(l+)/W.  (77) 

*=1 

or,  more  concisely, 

^MMFA  ^  ^  ^  a^min  ("8) 

k=l 

We  can  modify  our  calculation  of  the  cost  functional  by  substituting  this  expression 
for  a*'*  in  Equation  (75)  and  computing  the  cost  for  any  given  lower  bound  on  the 
probability. 


2.3  Controller  Cost  Functional 

In  the  next  few  sections,  the  equations  required  to  design  an  optimal  MMCA 
system  are  developed.  The  system  of  interest  is  a  linear  regulator  as  shown  in 
Figure  10  (which  is  a  repeat  of  Figure  4).  In  an  effort  to  get  good  regulation,  it  is 
desirable  to  minimize  the  cost  functional 

(79) 

Since  it  is  assumed  that  the  admissible  parameter  set  is  constant  for  a  given 
problem,  it  is  necessary  only  to  minimize  the  numerator  of  Equation  (79).  Moreover, 

E{(yl'^Wly|}  =  E{tr|(y|7wly||} 

=  E{lr(W|ylly]7|} 

=  tr(WE{[y||y]7}) 

=  lriWE{C-[x][xl7c-7}] 

=  tclWC'*cC‘7]  (80) 


„  A  /x  E{[y|7w|yl}da 
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Figure  10.  Linear  Regulator  Using  an  MMCA 


Thus,  one  needs  to  develop  the  equations  necessary  to  compute  the  regulation  error 
autocorrelation, =  E{[x][x]^},  as  a  function  of  the  parameter  vector  a.  The 
following  section  illustrates  this  development. 

2.3.1  Derivation  of  the  Controller  Equations.  The  MMCA  consists  of  a  bank 
of  Kalman  filters,  each  in  cascade  with  a  constant  gain.  Each  Kalman  filter/ controller 
gain  combination  is  based  on  a  representative  parameter  vector  a^.  We  want  to  find 
out  what  the  regulation  error  autocorrelation  is  when  we  use  a  perturbation  model 


based  on  a*: 

x\t,^,)  =  ^'^it,^,,t,)x\ti)  +  (81) 

y\U)  =  C*(f.}x^(<.)  (82) 

z’^iU)  =  H'‘{t,)x'‘{ti)  +  v\U)  (83) 

which  varies  from  the  actual  system  with  the  truth  model  which  is  based  on  the  true 
parameter  vector  a*: 

x*(t,+J  =  -?•(«.•+„  <.)x-(i.)  +  B:(<.)u*(i.)  +  G;(<.)w*(<.)  (84) 

y*(f,)  =  C-(f.)x*(t.)  (85) 
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(86) 


and  where 

x*(^) 

x*(<.) 

u*(f.) 

U*(<i) 


B:it,) 


w3(^) 


is  the  n*  dimensional  state  vector  in  the  estimator  filter  models. 

is  the  n*  dimensional  error  state  vector  in  the  truth  model;  n* 
need  not  be  the  same  as  n*. 

is  the  n*  x  n*'  state  transition  matrix  for  the  filter  model. 

is  the  n*  x  n"  state  transition  matrix  in  the  truth  system  model. 

is  thep  dimensional  input  vector  generated  by  the  con¬ 
troller. 

is  the  p  dimensional  input  vector  which  is  calculated  by 

u-((,)  S  I;  u‘((,)p‘(<,)  (87) 

Ar=l 

is  the  n*  X  p  input  matrix  for  the  model, 
is  the  n*  X  p  truth  system  input  matrix. 


is  an  s*-vector-valued  discrete-time  white  Gaussian  noise 
cess  with  zero  mean  and  covariance  kernel 


q5(«.)  u  =  tj 


0  t,  ^ 


pro- 


is  an  s*-vector-valued  discrete-time  white  Gaussian  noise  pro¬ 
cess  with  zero  mean  and  covariance  kernel 


E{[wj(f,)][wd(f>))''’}  = 


QAU)  U  =  t, 

0  ti  A  ij 


is  the  x  s*  noise  input  matrix  for  the  filter  model. 
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Gj(i,)  is  the  n*  x  s*  noise  input  matrix  for  the  truth  model. 

y^(t,)  is  the^  dimensional  output  vector  to  be  controlled  as  modelled 
in  the  estimator  filter  model. 

y*(f,)  is  the  q  dimensional  output  vector  to  be  controlled  in  the  truth 
model. 

C*(<,)  is  the  q  X  output  matrix  for  the  model. 

C’{ti)  is  the  q  X  n*  truth  system  output  matrix. 

z*'  is  the  m  dimensional  random  measurement  vector  as  modelled 
in  the  filters. 


z*  is  the  m  dimensional  random  vector  which  represents  the  “real 
world”  measurement  from  the  truth  model. 

is  the  m  X  measurement  matrix  as  modelled  in  the  filters. 

H’{ti)  is  the  m  X  n'  measurement  matrix  in  the  truth  system  model. 

v*(f,)  is  an  m-vector-valued  discrete-time  white  Gaussian  noise  pro¬ 
cess  with  zero  mean  and  covariance  kernel 


w*  and  V*  are  assumed  to  be  independent. 

v*(fi)  is  an  m-vector-valued  discrete-time  white  Gaussian  noise  pro¬ 
cess  with  zero  mean  and  covariance  kernel 

E{(v(t,))[v((,)r}  = 


wj  and  v*  are  assumed  to  be  independent. 


R^{ti)  U  =  tj 
0  ti  ^  tj 


R'^iU)  ti  =  t, 

0  tj 
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The  development  uses  steady  state  arguments  so  the  following  assumptions  are 
made  about  time  invariance; 


U+i  —  ti  =  a,  constant  Vi 

<.)  =  #*  V< 

=  Vt 
=  Vi 
BI{U)^B:  Vi 
G^(i.)  =  G‘  Vi 
G*(i.)  =  G:  Vi 
C\ti)  =  C*  Vi 
C*(i,)  =  c*  Vi 
H*'(i.)  =  Vi 
H*(i,)  =  H*  Vi 

The  filters  are  cissumed  to  be  constant-gain  filters  (i.e.,  /T*  does  not  vary  with 
time).  The  filter  gain  may  be  calculated  in  any  fashion  that  produces  a  stable  estima¬ 
tor.  However,  to  use  this  method  effectively  for  minimizing  the  error  autocorrelation, 
an  automated  method  which  can  determine  iC*  for  any  given  parameter  is  suggested. 
For  the  example  shown  in  the  sequel,  the  steady  state  Kalman  filter  gain  is  chosen. 

We  will  similarly  assume  the  controllers  are  designed  in  any  suitable  fashion 
as  constant  gain  state  feedback  controllers  and  implemented  with  the  form 

u*(f.)  =  (88) 

Again,  to  use  this  method  effectively,  an  automated  method  which  can  determine 
G^  for  any  given  parameter  is  suggested.  For  the  example  shown  in  the  sequel,  an 
optimal  LQG  regulator  gain  [37,  Chap.  14]  is  chosen. 
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We  want  to  minimize  the  true  output  autocorrelation,  hence,  we  want  to  find 
the  autocorrelation  matrix: 

E{[yn(y‘r}  =  c-|e{(x:i[x-]T}]c-t  (89) 

The  truth  system  state  is  not  affected  by  measurement  updates  in  the  MMCA  so 
x'(fi)  =  x*(t“).  The  filter  selection  is  driven  by  the  prediction  error  so  we  will 
derive  the  equations  for  the  autocorrelation  matrix: 

E{ly-lly-|'^}  =  c-lE{(x--llxrl'f}|c-f  (90) 

As  was  done  with  the  estimator  equations,  we  derive  the  one-step  prediction  model 
from  the  standard  Kalman  filter  equations  [35,  page  275]  as  follows: 

x‘((+)  =  x‘(ir)  +  K*[Z'(<,)  -  Jif‘x‘(i-)1  (91) 

x‘(  (,■;.)=  #‘x‘((n  +  BjVlfO  (92) 

The  steady  state  development  requires  an  important  assumption  at  this  point. 
It  is  assumed  that  the  estimator  section  of  the  MMCA  has  converged  to  the  proper 
filter.  Hence, 

=  1  =>  u*(f.)  =  (93) 

where  sel  denotes  the  filter  selected  by  the  identification  process.  All  other  p-^’s  are 
zero.  As  the  functional  is  integrated  across  the  parameter  space,  different  filters 
are  selected.  The  following  development  is  for  the  filter  k  which  has  been  selected 
by  the  identification  process,  and  is  not  applicable  to  the  other  filters.  All  filters 
are  propagated  based  on  the  input  vector,  u*(<,),  not  the  input  calculated  by  the 
corresponding  elemental  controller. 

With  this  in  mind,  substituting  Equation  (88)  into  Equation  (92)  gives 

**(<-+,)  =  -  B;Gjx*(i+)  (94) 

Substituting  Equation  (91)  into  Equation  (94)  gives 

-  B5'3;){ii‘((-)  +  *r‘[H-x-((i)  +  v-((0  -  H‘x‘((-))}  (95) 
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Collecting  terms  yields 

+  (4>''  _  B'^G’^JK’^v'iti)  (96) 

The  true  system  is  modelled  by 


=‘'('r+j  =  *'^vr)  +  +  GX(/.)  (97) 

Recalling  that  u*(^j)  =  u*'(t,),  substituting  Equation  (88)  into  Equation  (97)  yields 

=  #-x-((-)  -  B;G‘Si\t*)  +  G>;(«,)  (98) 

Substituting  Equation  (91)  into  Equation  (98)  gives 

x-((-,.)  =  i>-x-((-)  -  BJG5{*‘(<r)  +  B-‘|H-x-((,)  +  v-((,)  -  H‘x‘(<-)1} 

+  G>;((i)  (99) 

Collecting  terms  yields 

x'(tr+.)  =  (#■  -  B;GjJir*H‘)x‘((-)  -  BJGj(/ -  Jf*H*)x‘((-) 

-  B;Gjjir‘v-((,)  +  G>:((,)  (loo) 


This  gives  us  the  one-step  prediction  model 


X*  ~ 

(<P*  -  B^^G^K^H*)  -B^G*(/  - 

<- 

Jk  — • 

X . 

L  •+>  J 

+ 


-B;Gjiir‘ 
(#*  -  B'iG))K'‘ 


g: 

v:  + 

a 

1 

O 

w 


di 


(101) 


The  prediction  model  of  the  tracking  error  equation  is  now  of  the  form: 


where 


+  Qo 


(^*  -  S*G*A'''/f*)  -B*G*(/  - 


(102) 
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Qo  = 


G*  -B-,GlK'‘ 
0 


and  '  is  an  s*  -f  m-vector  valued  discrete  time  white  Gaussian  noise  process 


with  zero  mean  and  covariance  kernel 


Wj-  t  t  QoUi)  ti  =  tj 
E{  [wiVl)  = 

V-  0  u  ^  tj 


Qo{U)  — 


Q:{ti)  0 
0  R*{ti) 


Since  and  v*  are  assumed  to  be  independent  random  vectors,  they  are  uncorre¬ 


lated. 


This  structure  allows  us  to  calculate  the  autocorrelation  of 


matrix  equation 


using  the 


*1+1  "i+i 


^.-T  ]  I  =  =  rSe.T^  +  QoQoG] 


(103) 


where  sel  denotes  the  filter  selected  at  any  particular  point  in  the  admissible  param¬ 
eter  set  so  that  =  1.  The  case  where  p®'*  <  1  is  discussed  in  Sections  2. 3. 3. 3  and 


3.3.5.4. 


The  upper  left  partition,  E{[x*  ][x*"]^},  of  this  matrix  is  ^c-  Letting  i  =>  oo, 
we  can  calculate  the  steady  state  value  of  E{[y*”]^W[y*~]}  using  Equation  (80). 


2.3.2  Controller  Design  Algorithm.  The  following  is  a  procedure  by  which  the 
cost  functional  of  Equation  (79)  can  be  numerically  approximated  and  minimized 
to  provide  the  representative  parameters  for  an  optimized  multiple  model  adaptive 
regulator.  The  algorithm  may  be  adapted  for  other  control  structures  such  cis  trackers 
[37,  pages  114-122]  or  perturbation  regulators  [37,  page  125]. 
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1. )  Start  uy  describing  the  system  in  terms  of  the  parameter  vector  a,  spec¬ 
ifying  the  structure  of  the  truth  system  and  the  filters  to  be  implemented  in  the 
controller.  This  includes  specifying  the  method  for  calculating  and  given  the 
parameter  vector  a. 

2. )  Choose  K,  the  number  of  regulators  to  be  implemented. 

3. )  Choose  a  representative  parameter  set  to  begin  the  minimization.  A  rea¬ 
sonable  choice  might  be  made  by  dividing  the  admissible  parameter  region  into  equal 
intervals  and  choosing  the  midpoint  of  each  interval.  The  regulator  problem  is  more 
likely  to  produce  unstable  systems  than  the  estimator  problem.  Even  though  the 
LQ  regulator  has  guaranteed  stability  margins  [37,  page  11],  and  the  Kalman  filter 
has  similar  robustness  [37,  page  99],  the  filter/controller  combination  has  no  guaran¬ 
teed  stability  margins  [9].  If  the  starting  point  results  in  an  unstable  regulator  for  a 
particular  parameter  value,  the  designer  has  the  choice  of  changing  the  rule  for  the 
controller  gain  calculations  to  make  the  regulators  more  robust,  or  moving  the  closest 
filter  towards  the  unstable  point.  It  might  also  be  desirable  to  try  the  minimization 
with  more  filters  to  gain  insight  for  systems  with  fewer  filters.  One  other  method 
that  worked  particularly  well  for  the  example  problem  of  Section  3.3.2,  was  using 
a  penalty  function  that  places  an  additional  cost  on  each  unstable  point  evaluated 
while  calculating  the  cost. 

4. )  Choose  a  numerical  integration  technique  which  will  be  used  to  evaluate 
Equation  (79).  This  integration  provides  the  value  of  the  functional  for  a  specific 
choice  of  representative  parameter  set.  Equation  (80)  defines  the  curve  of  regulation 
error  autocorrelation  as  a  function  of  the  parameter  vector.  The  information  required 
to  set  up  Equations  (101)  and  (103)  is  chosen  based  on  the  proper  filter  selection  per 
Equation  (55).  It  should  be  noted  that  the  filter  selection  is  based  entirely  on  the 
estimator  in  the  loop,  and  is  calculated  with  the  equations  developed  in  Section  2.1.1. 

5. )  Step  4  generates  a  numerical  approximation  to  the  value  J(a)  for  a  prespec¬ 
ified  choice  of  parameter  vector  a.  The  problem  then  becomes  a  vector  minimization 
problem.  Choose  a  numerical  vector  minimization  technique  and  proceed  to  mini¬ 
mize  J(a)  using  the  procedure  in  step  4  to  e"  luate  the  functional  for  each  parameter 
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vector. 


As  is  the  case  with  the  state  estimator  example,  the  extended  trapezoidal 
rule  [51,  Eqn.  (4.1.11)]  is  the  numerical  integration  technique  used  in  the  regulator 
example  of  Chapter  III.  The  vector  minimization  is  done  using  a  Broyden-Fletcher- 
Goldfarb-Shanno  (BFGS)  variable  metric  method  for  unconstrained  minimization 
(62], 


2.3.3  Modifications  to  the  Controller  Design  Algorithm.  The  development  of 
the  controller  optimization  algorithm  is  analogous  to  the  development  of  the  state 
estimator  optimization  algorithm.  So,  as  explained  in  Section  2.1.3,  the  algorithm  is 
general  enough  to  account  for  MIMO  systems  and  parameter  vectors.  Incorporating 
the  ME/I  mechanization  is  accomplished  in  the  same  manner  as  it  in  the  state 
estimator  optimization  algorithm. 

2.3.3. 1  Controller  Weighting  Matrices.  The  controller  optimization  al¬ 
gorithm  also  incorporates  a  weighting  matrix,  W,  to  allow  the  designer  to  handle 
tradeoffs  between  states.  In  addition,  the  control  problem  provides  added  flexibility 
to  vary  the  respective  weight  of  the  states.  The  controller  gains  are  calculated  to 
minimize  a  continuous  time  quadratic  cost  functional  of  the  form 

J  =  f  [x^(t)Sx(t)  +  u^(^)Uu(t)]dt  (104) 

Jo 

Note  that  different  choices  of  S  and  U  produce  different  gain  matrices,  G*.  The 
effect  of  varying  these  v/eighting  matrices  is  explored  in  Section  3.3.5. 1,  where  these 
matrices  are  chosen  to  be  consistent  with  the  choice  of  W.  Maintaining  this  consis¬ 
tency  is  not  necessarily  required  for  the  algorithm  to  work  properly. 

2. 3. 3.2  An  Alternative  Maximum-Type  Cost  Functional.  As  shown  in 
Section  2. 1.3. 2,  a  maximum- type  cost  may  be  added  to  the  basic  quadratic  cost 
functional,  allowing  the  designer  to  trade  off  between  average  mean  square  regula¬ 
tion  error  and  maximum  mean  square  regulation  error.  Many  times,  however,  the 
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maximum  allowable  mean  square  regulation  error  is  set  to  a  specific  value,  i.e., 

max  'I'c  =  n>ax  <  Spec 
a€.^ 

The  basic  controller  optimization  algorithm  can  be  modified  to  attack  this  type  of 
specification  by  adding  a  penalty  function,  (j?  ,  to  the  quadratic  cost  so  that 

J  =  Jg  +  (105) 

The  penalty  function,  (,5,  is  a  function  of  the  maximum  regulation  error  autocorrela¬ 
tion  as  shown  in  Figure  11. 


Maximum  Regulation  Error  Autocorrelation,  max 
Figure  11.  Penalty  Function  for  Maximum  Regulation  Error  Autocorrelation 

The  penalty  function  adds  a  quantum  cost  if  'fc  max  is  above  the  specification 
and  increcises  linearly  as  max  increeises.  The  quantum  cost  should  be  large  enough 
so  that  it  overwhelms  the  quadratic  component  of  the  cost  functional  when  max 
exceeds  the  specifications,  thus,  prohibiting  tradeoff  between  the  penalty  function 
^  and  the  quadratic  cost  J*.  The  linear  region  gives  the  optimization  algorithm 
information  concerning  which  direction  to  go  in  order  to  decrease  the  cost.  This 
method  is  used  in  Section  3. 3. 5. 3. 

2. 3. 3. 3  Avoiding  Controller  Lockup.  The  development  in  Section  2.3.1 
was  based  upon  the  assumption  that  the  MMCA  had  converged  to  the  selected 
controller  with  probability  1.  It  has  been  shown  in  Section  2. 1.3. 3  that  it  is  reasonable 
to  place  lower  bounds  on  the  probabilities  to  keep  the  estimator  section  of  the  MMCA 
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from  experiencing  lockup.  The  controller  cost  functional  calculations  can  be  modified 
to  take  this  into  account.  To  do  so,  we  must  derive  the  one-step  prediction  model 
from  the  standard  Kalman  filter  equations  [35,  page  275]  as  follows: 


=  x*(r)  +  K'‘[x’{t,)  - 


(106) 

(107) 


At  this  point  in  the  development  of  Section  2.3.1,  it  was  assumed  that  the 
estimator  section  of  the  MMCA  has  converged  to  the  proper  filter.  Instead,  we 
assume  the  input  is  the  probabilistically  weighted  average: 


K 


and  each  input  is  of  the  form 


j=i 


u'((0  = 


(108) 


(109) 


With  this  in  mind,  substituting  Equations  (108)  and  (109)  into  Equation  (107)  gives 

K 


>=» 

=  ii‘x‘(!f)-B;f;p'G;x'((+) 

>=i 

Substituting  Equation  (106)  into  Equation  (111)  gives 


(110) 

(111) 


*‘(*r+.)  =  4>‘{*‘((.-)  +  i>r‘K((0-H‘x‘((,-)l} 


4*  r, 


K 


-  Y.  p'G;{*'(<,-)  +  K'lz-((,)  -  H'x'(|-)1}  (112) 


=  ^*{x‘((r)  +  if'|H-x-((,)  +  v-(;,)  -H‘x'(<-)l} 


\k  r~kri- 


k^kf  1-' 


K 


-B^Yp'Gix'U-) 

-B*  Y  +  v-((i)  -  H‘x>(l-)\ 


(113) 
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Rearrange  and  collect  terms  to  get 


i=i 

-  E  p^aiK>H-x-M 


-  b;  E  P^GiKWV.) 


(114) 


x‘(i- )  = 


-  Bj  E  P'G'C/  - 

J=1 

(  ^  \ 

+  I  ^  x’(ti) 


+  -  Bj  x;  v*(f,) 


The  true  system  is  modelled  by 


(115) 


X‘(r.  )  =  ^-xV-)  +  B>*(<.)  +  G>*(<.) 


(116) 


Substituting  Equations  (108)  and  (109)  into  Equation  (115)  yields 


xvz.,)  =  *-x-(t-)  +  B;£p^u’(t,)  +  a;w;(i,) 


=  *-x-(i-)  -  b;  E 


(117) 


Substitute  Equation  (106)  into  Equation  (117)  to  get 


x'(<.:;j  =  ^^xVn 


+  G>:(t.) 


(118) 
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-  B;  |:p'<3;{x'((-)  +  K’lH-x'M  +  V((.)  -  H’x^t;)l} 

+  G>;(iO  (119) 


J  =  1 

-B;£p>G>Jl-K>Hnx>(l-) 

j=l 

-  b:  E  p‘GiK‘x-{t,)  +  g;w;((,)  (120) 

J=1 

This  gives  us  the  one-step  prediction  model  of  the  form: 


■^1+1 

=  T, 

xr 

-|-  Qa 

(121) 

-  K  - 
X . 

«  +  i  J 

x> 

... 

1 

where 

-  B-  P^GiK^H-)  -K'H') 

{^  K  H‘  -  ^  -  KH)  -  B'jp'G^H  -  K  W 

fG{K^H*)  -B^p^G^I  -  iiT'if*) 

-B-p^G^il  -  K^H^) 

-B'^p^G^il  -  K^H^) 


...  ^f<{I  -  -  B^P^G^{I  -  K^H^) 

0  -B^,Ef=.P^GiK^) 

0  Ef=,  P^G^K^) 
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(122) 

The  upper  left  partition,  E{[x*~][x*']^},  of  this  matrix  is  denoted 
E{[yn"^w[yr  ■]}  can  then  be  calculated  using  Equation  (80). 

Using  Equation  (122)  instead  of  Equation  (103)  allows  us  to  calculate  the  cost 
of  a  sys.em  for  any  given  lower  bound  on  the  probability.  Note  that  if  the  lower 
bound  is  zero,  Equation  (103)  reduces  to  Equation  (122). 

2.4  Summary 

The  development  of  the  basic  optimization  algorithms  for  MMFA  and  MMCA 
design  has  been  presented  in  this  chapter.  These  algorithms  minimize  the  prediction 
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error  autocorrelation  at  t~  with  the  anticipation  that  the  error  autocorrelation  at 
is  also  minimized.  The  algorithms  as  presented  are  general  in  nature  and  applicable 
for  any  finite  system.  The  only  limitation  to  practical  design  is  the  amount  of 
computing  power  av'ailable  to  perform  the  calculations  required  for  the  algorithm.  A 
weighting  matrix  in  the  cost  functional  allows  the  designer  to  vary  the  importance 
of  individual  states  or  parameters. 

Several  possible  modifications  to  the  basic  MMFA  and  MMCA  structure  have 
been  proposed  which  affect  the  performance  of  the  final  systems.  These  include 
changing  the  hypothesis  conditional  probability  calculation,  using  alternative  cost 
functionals,  and  placing  lower  bounds  on  the  pseudo-probabilities.  The  develop¬ 
ment  of  the  associated  modifications  to  the  optimization  algorithms  have  also  been 
presented. 

An  example  is  presented  in  Chapter  III  to  illustrate  the  benefits  of  using  these 
algorithms;  in  part,  to  analyze  multiple  model  adaptive  estimation  and  control  sys¬ 
tems;  and  as  a  whole,  to  design  MMFAs  and  MMCAs  to  provide  minimum  mean 
square  errors.  The  modifications  discussed  above  are  also  evaluated  in  Chapter  III. 
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III.  Evaluation 


The  system  design  methods  presented  in  the  previous  chapter  seem  to  be  sound. 
However,  with  any  design  method  it  is  necessary  to  have  some  quantitative  informa¬ 
tion  pertaining  to  the  value  of  the  method  for  “real  world”  design.  It  is  impossible 
to  provide  a  numerical  value  representing  the  increase  in  performance  over  other  de¬ 
sign  methods  without  first  knowing  the  specifics  about  the  problem  at  hand.  A  very 
simple  example  is  presented  in  this  chapter  to  help  the  reader  evaluate  the  merits  of 
this  design  method.  The  example  problem  is  described  in  Section  3.1  along  with  the 
MMFA  evaluation.  The  same  problem  is  used  to  evaluate  the  parameter  estimation 
performance  in  Section  3.2,  and  the  MMCA  regulator  performance  in  Section  3.3. 

3. 1  State  Estimator  Evaluation 

An  example  is  shown  here  to  demonstrate  how  useful  insights  can  be  gained 
from  approaching  the  design  of  an  MMFA  through  minimizing  the  prediction  er¬ 
ror.  A  variety  of  previously  developed  methods  are  used  to  design  estimators,  and 
their  associated  cost  and  parameter  discretization  are  compared  to  the  cost  and  dis¬ 
cretization  of  an  estimator  which  minimizes  the  cost  functional.  Equation  (9).  The 
exact  procedure  used  to  calculate  the  functional  and  to  find  a  minimizing  set  of 
representative  filters  is  explained  in  Section  3.1.2. 

3.1.1  System  Description.  The  true  system  is  an  ideal  mechanical  transla¬ 
tional  system  as  shown  in  Figure  12.  It  is  a  continuous  time  system  which  is  modelled 
by  the  second  order  linear  equation: 


x(<)=F(t)  x{t)+B{t)  u(t) 


(123) 


Figure  12.  Ideal  Mechanical  Translational  System 


ii(t)  0  1  xAt)  0 

L  J  L  J  L  J  L  i  J 

where  ii  is  the  position  in  meters  and  X2  is  the  velocity  in  meters  per  second.  This 

problem  can  be  restated  in  a  more  general  form  by  letting  denote  the  undamped 


natural  frequency,  and  letting  C  denote  the  damping  ratio, 


6 

2-y/km' 


To  frame  the  system  in  a  stochastic  setting,  a  dynamic  driving  noise  term  is 
added  to  the  model  to  produce 


x(<)  =  F(<)x(<)  +  B(<)u(t)  +  G(t)w(t) 


0  1 


-ul  -2Cu;„  X2(0 


Xi(t)  0  0 

+  u{t)  +  w{t) 


(125) 


(126) 


E{t/;(t)i£;(<  +  r)}  =  Q6{t) 


Q  =  0.01 


The  damping  ratio  is  known  to  be^  =  0.01  and  the  natural  frequency,u;„  , 
is  allowed  to  vary  from  2?r  (~  6.28)  to  207r  (~  62.83)  radians  per  second.  These 
values  of  Q,  (^,  and  were  chosen  to  accentuate  the  differences  between  models 


56 


by  increcising  the  sensitivity  to  change  in  natural  frequency.  Larger  damping  ratios 
such  as  i;  =  0.707  result  in  a  system  where  there  is  less  distinguishable  difference 
between  models,  so  a  multiple  model  technique  may  not  be  warranted  at  all.  Q  was 
chosen  to  provide  enough  system  energy  for  convergence  of  the  adaptation  process, 
yet  not  so  much  as  to  swamp  out  the  system  with  noise  effects.  The  values  chosen 
may  be  associated  with  a  large  space  structure  and  are  not  unrealistic  [23,  page  44]. 
Any  system  with  a  high  sensitivity  to  parameter  variation  may  be  a  candidate  for 
an  MMFA. 

The  system  is  implemented  with  an  equivalent  discrete  time  system  model  of 
the  form  [35,  page  171]: 

+  Wi(<,)  (127) 

where 

L+i  ~  L'  =  0-01  sec  V  i 

=  the  state  transition  matrix  associated  with  F(f) 

Bd{ti)  =  /  ^(fi+,,r)B(r)dr 

and  Wd{ti)  is  discrete-time  white  Gaussian  noise  process  with  zero  mean  and  covari¬ 
ance  kernel 

E{[w<i(f.)][w<i(fj)]''’}  =  - 

0  U  ^  t, 

where 

Jt, 

Noisy  measurements  are  available  at  each  sample  instant  described  by  the  equation 

2(f.)  =  H(<.)x(<.) -f  u(f.)  (128) 

z{t,)  =  [\  0]  +v{t,)  (129) 

X2(t) 
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where  v{ti)  is  a  zero-mean  white  Gaussian  noise  process  with  covariance  kernel 


and 


R{ti)  i  =  j 

< 

0  i^j 


R{U)  -  0.01  V  i 


This  is  the  same  problem  Hentz  studied,  except  it  is  simplified  for  illustration 
by  not  allowing  C  to  vary  [19,  pages  16-lP],  [39]. 


3.1.2  Filter  Selection  Process.  This  section  describes  in  detail  the  steps  in¬ 
volved  in  designing  an  MMFA  for  the  example  problem.  The  main  point  of  this 
section  is  to  show  how  useful  insights  can  be  gained  by  attacking  an  estimator  de¬ 
sign  problem  through  minimizing  the  average  prediction  error  autocorrelation. 

The  first  step  in  any  design  is  to  find  out  what  the  optimal  or  best  result 
looks  like.  The  optimal  estimation  error  autocorrelation  corresponds  to  an  MMFA 
consisting  of  an  uncountably  infinite  number  of  filters,  each  designed  with  at  a 
different  point  in  the  interval  [27r,207r]  and  all  other  values  equal  to  the  truth  system 
values  [36,  pages  129-130).  One  can  approximate  this  solution  by  calculating  the 
cost  functional  for  a  fine  discretization  of  the  parameter  space  assuming  that  there 
is  a  filter  designed  at  each  evaluated  point.  This  involves  solving  Equation  (47)  at 
a  sufficient  number  of  points  in  the  interval  to  produce  a  smooth  curve.  Figure  13 
is  a  plot  of  the  trace  of  the  minimum  achievable  steady  state  error  autocorrelation 
matrix, >  for  the  example  problem  cis  a  function  of  the  true  natural  frequency, 
using  a  discretization  of  201  equally  spaced  points. 

This  plot  of  the  optimal  estimation  error  provides  valuable  information  about 
the  performance  one  can  expect  from  an  MMFA.  The  prediction  error  covariance  of 
an  implementable  MMFA  will  always  be  above  this  curve.  The  shape  of  the  curve 
also  provides  information  for  a  first  guess  at  the  optimized  design.  As  shown  in  the 
next  section,  the  filter  parameters  should  be  placed  closer  together  at  higher  values 
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Figure  13.  Optimal  Estimation  Error  Autocorrelation  for  Example  Problem 


of  tr^o,  because  the  system  is  more  sensitive  to  change  in  the  parameter  at  Iiigher 
values  of  tr^g.  The  closer  the  starting  point  is  to  the  optimal  design,  the  fewer 
evaluations  will  be  necessary  to  get  within  the  desired  accuracy. 

The  value  of  the  cost  functional  for  the  example  problem,  with  W  =  I„xn 
(giving  position  and  velocity  equal  weight)  and  assuming  an  infinite  bank  of  filters  is 
used,  can  be  calculated  by  integrating  the  curve  in  Figure  13.  This  value  is  denoted 
Jo  and  is  calculated  to  be  0.04393m^  +455.5m^/s^.  However,  it  is  obvious  that  no 
one  can  build  an  estimator  with  an  infinite  number  of  filters.  Several  methods  have 
been  used  previously  to  select  K  representative  parameter  values  for  the  estimator. 
The  methods  applicable  to  this  problem  are  listed  here. 
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Method  1.  An  uninformed  first  choice  might  be  to  divide  the  parameter 
space  into  K-1  intervals  and  design  a  filter  at  each  interval 
boundary  [10,  23,  24].  (e.g.,  chosing  three  design  parameters 
from  the  interval  [0,1]  using  this  method  would  produce  the 
parameters  (0.0,0. 5, 1.0).)  Hentz  did  this  with  (  in  his  thesis 
[19,  pages  19-20]. 


0.0 


0.5 


1.0 


+ 


+ 


+ 


a 


3 


Method  2.  A  method  designed  to  cover  the  parameter  space  better  than 
method  1  is  to  divide  the  parameter  space  linearly  into  K 
intervals  and  design  a  filter  at  the  midpoint  of  each  inter¬ 
val.  (e.g.,  chosing  three  design  parameters  from  the  inter¬ 
val  [0,1]  using  this  method  would  produce  the  parameters 
(0.1667,0.5,0.8333).) 

0.0  0.167  0.5  0.833  1.0 


Method  3.  Hentz  noticed  that  frequency  response  characteristics  vary 
logarithmically  in  this  system  so  he  divided  the  ujn  space  log¬ 
arithmically  into  K-1  intervals  and  designed  a  filter  at  each 
boundary  [19,  pages  19-20].  Hentz  divided  thew^  space  into 
nine  intervals  and  designed  ten  filters  using  the  boundary 
points.  Table  1  lists  the  values  he  used  for  his  bank  of  filters. 

Method  4.  The  latest  method  to  be  used  at  AFIT  is  to  choose  a  nominal 
point  to  design  the  first  filter  and  simulate  responses  as 
the  true  parameter  value  moves  away  from  the  nominal, 
choosing  an  arbitrary  reduction  in  estimation  accuracy  to 
signal  the  point  at  which  a  new  filter  should  be  designed  (see 
[29,  Section  4.4.4]  and  [30,  34]).  For  example,  a  45%  increa.se 
in  the  value  of  E{[x(f/')]^[x(<['')]}  (refer  to  Eqn.  (29))  is  used 
in  the  example  so  that  three  filters  cover  the  region. 
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Filter 

ijjn  (rad/sec) 

1 

6.28 

2 

8.12 

3 

10.48 

4 

13.54 

5 

17.48 

6 

22.58 

29.16 

37.67 

48.65 

62.83 

Table  1.  Logarithmic  Division  of  the  Frequency  Parameter 


u)n  (rad/ 

sec) 

Filter 

Method  1 

Method  2 

Method  3 

Method  4 

By  Optimization 

1 

6.28 

15.71 

6.28 

MSSM 

25.89 

2 

34.56 

34.56 

19.86 

41.61 

3 

62.83 

53.41 

62.83 

56.01 

Table  2.  Parameter  Choices  for  Various  Methods 

For  purposes  of  illustration,  three  filters  are  used  to  construct  an  MMFA  for 
this  example  problem.  The  values  of  Wn  chosen  using  each  of  the  four  methods  above 
are  shown  in  Table  2.  The  weighting  matrix  W  is  chosen  as  the  identity  matrix,  I, 
and  the  truth  model  is  the  same  dimension  as  the  filter  models  (T  =  I). 

Also  shown  in  Table  2  are  the  values  of  u;„  chosen  by  the  numerical  optimiza¬ 
tion  described  in  Section  2.1.2.  For  this  optimization,  it  is  eissumed  that  the  filters 
implemented  are  full  state,  constant  gain  filters.  The  gain  used  in  the  filters  is  the 
steady  state  Kalman  filter  gain  a.ssuming  the  correct  values  of  Qd  and  R. 

Methods  one  through  three  are  geometric  in  the  parameter  space  and  require 
no  further  discussion.  Method  four  requires  a  trial  and  error  approach,  calculating 
the  steady  state  autocorrelation  of  x(tf)  as  a  function  of  a  for  a  “nominal”  filter 
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and  using  a  predesignated  increase  in  the  error  autocorrelation  (e.g.  45%)  to  signal 
the  point  at  which  another  filter  is  required.  It  was  found  that  =  50.00  is  a  good 
nominal  value  for  this  problem.  At  values  of  <  50.00,  the  error  autocorrelation 
never  exceeds  145%  of  the  value  at  =  50.00.  As  shown  in  Figure  14,  for  a;„  > 
50.00,  the  value  of  the  error  autocorrelation  reaches  145%  of  the  nominal  value 
around  =  54.00.  Taking  =  54.00  as  the  nominal,  the  value  of  the  error 
autocorrelation  reaches  145%  of  this  nominal  around  tx>n  =  58.00,  as  is  shown  in 
Figure  15. 

3.1.3  Preliminary  Analysis.  A  preliminary  analysis  is  accomplished  by  cal¬ 
culating  the  cost  functional,  of  Equation(49),  for  each  choice  of  parameters.  Plots 
of  the  steady  state  prediction  error  autocorrelation  are  shown  in  Figures  16  -  20  as  a 
function  of  the  truth  system  parameter  for  each  of  the  design  methods.  The  optimal 
prediction  error  autocorrelation,  which  would  require  a  filter  at  every  value  of  a, 
is  also  shown  to  illustrate  how  the  estimation  degrades  when  the  true  parameter  is 
far  from  the  parameters  assumed  within  the  filters.  Triangles  on  the  frequency  axis 
denote  the  filter  parameter  values. 

The  discontinuities  in  the  curves  show  the  points  in  the  parameter  space  where 
the  MMFA  switches  filters.  It  is  interesting  to  note  that,  cis  the  true  parameter 
increases,  the  MMFA  switches  to  a  filter  with  higher  prediction  error  autocorrelation. 
Since  our  objective  is  to  find  a  system  with  minimum  estimation  error,  this  is  to  our 
detriment.  A  maximum  entropy  with  unit  covariance  (ME/I)  estimator  [25]  is  one 
step  closer  to  our  objective,  as  the  filter  selected  in  the  ME/I  estimator  is  the  one 
with  the  lowest  residual  autocorrelation.  This  is  discussed  further  in  Section  3. 1.5. 2. 

The  cost  functionals  are  calculated  by  integrating  the  plots  in  Figures  16  -  20, 
and  these  are  tabulated  in  Table  3  for  each  of  the  design  methods.  To  compare  the 
preliminary  results,  we  use  method  1  as  the  baseline  and  make  relative  comparisons. 
It  is  evident  from  Figure  16  that  placing  the  filters  by  method  1  puts  the  most 
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Figure  14.  ^{if)  Error  Autocorrelation  for  Method  4,  wnom  =  50.00 


Figure  15.  x(t/’)  Error  Autocorrelation  for  Method  4,  wnom  =  54.00 


6.'! 


Estimation  Error  Autocorrelation  Estimation  Error  Autocorrelation 


Figure  17.  x(i,-  )  Autocorrelation  Using  Method  2 


Figure  20.  x(i,  )  Autocorrelation  Using  Basic  Optimization  Method 


Design 

Method 

(rn^) 

(m^/s'*) 

Method  1 

n.M08  + 

1876 

Method  2 

0.084. a  + 

986 

Method  3 

0.2692  + 

4283 

Method  4 

0.1810  T 

2482 

Optimization 

0.0742  + 

797 

Table  3.  Cost  Functional  Evaluation  for  Various  Methods 


parameter  space  between  filters.  Since  the  estimation  degrades  as  the  true  parameter 
moves  away  from  a  representative  parameter,  it  seems  reasonable  to  limit  the  space 
between  filters.  This  thinking  inspired  method  2,  and  as  shown  in  Figure  17,  this 
method  reduces  the  degradation  in  estimation  error.  As  shown  in  Table  3,  going  from 
method  1  to  method  2  cuts  the  cost  functional  in  half.  For  this  problem,  method  2 
is  the  best  of  the  geometric  methods. 

The  re'^uits  for  method  3  illustrate  how  improper  reasoning  can  hurt  a  design, 
it  seemed  reasonable  at  the  time  for  Hentz  [19,  39]  to  choose  a  logarithmic  division 
of  the  frequency  range  for  his  design.  .4s  shown  in  Table  3,  this  provides  a  cost 
functional  over  four  times  that  of  method  2.  It  is  evident  from  these  figures  that  it 
is  better  to  place  the  filters  closer  together  at  higher  frequencies. 

.Method  4  wais  developed  in  an  attempt  to  place  the  filters  on  the  basis  of  es¬ 
timator  performance  instead  of  geometrically  in  the  parameter  space.  One  problem 
with  this  method  is  that  the  filter  selection  is  not  taken  into  account,  it  is  evi¬ 
dent  from  Figure  19  that  the  .MMFA  switches  filters  somewhere  between  the  two 
representative  parameters  so  that  the  45%  increase  in  error  autocorrelation  is  not 
experienced  between  the  filters,  but  it  is  reached  on  the  ends  of  the  region.  Even  if 
the  switching  were  taken  into  account,  this  design  is  based  on  the  error  at  and 
the  filter  selection  is  based  upon  the  prediction  error  (at  t,~).  This  would  require 
calculating  error  autocorrelation  both  at  t~  and  tf .  It  would  be  mere  reasonable  to 
use  this  method  with  data  at  t~  as  it  show's  more  dependency  on  model  adequacy. 
Method  4  is  the  most  difficult  by  far,  as  it  involves  a  great  deal  of  trial  and  error 
to  get  the  parameter  region  covered  with  a  prespecified  number  of  filters.  Both 
the  nominal  parameter  value  and  the  amount  of  degradation  allowed  between  filter 
parameters  are  chosen  by  the  designer. 

The  set  of  parameters  found  by  the  optimization  algorithm  produces  a  sys¬ 
tem  with  a  cost  functional  around  19.2%  lower  than  the  nearest  competitor.  This 
would  lead  us  to  believe  that  an  .MMFA  implemented  with  the  optimal  discretization 
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Figure  21.  MMAE  Simulation  Block  Diagram 

would  provide  an  estimate  with  the  lowest  mean  square  prediction  error  on  aver¬ 
age.  Moreover,  the  optimized  system  should  exhibit  mean  square  prediction  errors 
approximately  19.2%  lower  than  those  exhibited  by  the  method  2  design.  To  test 
this  conjecture,  a  simulation  of  the  different  estimators  is  in  order. 

3.1.4  Estimator  Simulation.  To  evaluate  the  parameter  discretization  found 
by  the  optimization  algorithm  described  in  Chapter  II,  a  Monte  Carlo  simulation 
was  performed.  A  block  diagram  of  the  computer  simulation  is  shown  in  Figure  21. 
Equation  (127)  was  used  to  simulate  the  truth  system  with  no  input  (u*(<,)  =  0),  and 
it  was  driven  by  a  simulated  discrete  time  white  noise  process  that  was  the  same  for 
each  estimator.  The  truth  model  was  the  same  structure  as  the  filter  models,  and  the 
only  difference  was  the  value  of  so  the  transformation  matrix  T  was  the  identity 
matrix.  Each  MMFA  was  tested  by  running  20  simulations  at  each  of  200  values 
of  ijjn  (a*).  Each  simulation  was  ten  seconds  in  duration  and  remained  constant 
throughout  the  simulation.  Equation  (1)  was  used  to  propagate  the  probabilities 
forward  from  an  a  priori  probability  distribution  of  p%fo)  =  P^(^o)  =  P^(fo)  =  1/3, 
and  there  was  no  lover  bound  on  the  probabilities. 
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Quantity  of 
Interest 

Design  Method 

Meth.  1 

Meth.  2 

Meth.  4 

Opt. 

MS  Position 
Prediction  Error 
(meters^) 

0.1348 

0.0837 

0.2590 

n 

0.1756 

0.0731 

MS  Velocity 
Prediction  Error 
( meters^ /sec^) 

1792 

982 

4101 

MS  Parameter 
Estimation  Error 
(rad^/sec^) 

138.8 

39.4 

265.3 

495.8 

59.6 

Table  4.  Monte  Carlo  Simulation  x(<,  )  Results 


Quantity  of 
Interest 

Design  Method 

Meth.  1 

Meth.  2 

Meth.  3 

Meth.  4 

Opt. 

Mean  Square 
Position  Error 
(meters^) 

0.01003 

0.01433 

0.00911 

Mean  Square 
Velocity  Error 
( meters^ /sec^) 

439.7 

315.0 

719.0 

560.0 

285.5 

Table  5.  Monte  Carlo  Simulation  x(t^)  Results 


■9.J.5  Results.  The  results  of  the  Monte  Carlo  simulations  are  tabulated  in 
Tables  4  and  5.  Table  4  presents  the  average  mean  square  prediction  error  at  t~ 
for  each  simulation  calculated  by 


Mean  Square  Position 
Prediction  Error 

Mean  Square  Velocity 
Prediction  Error 

Mean  Square  Parameter 
Estimation  Error 
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Figure  22.  Parameter  Estimation  Convergence 


Table  5  presents  the  average  mean  square  estimation  error  at  for  each  simulation 
calculated  by 


Mean  Square 
Position  Error 


,  4000  1  1000 


(133) 


Mean  Square  1  1  ..  >,2, 

Velocity  Error  4000  run8=i  1=201 


n  34 ' 


The  simulation  results  are  averages  from  2  seconds  through  10  seconds.  This 
was  done  to  eliminate  the  effects  of  the  initial  identification  transient  from  the  re¬ 
sults.  Figure  22  is  a  plot  of  the  parameter  estimate  versus  the  true  parameter  for 


a  representative  sample  run  from  the  Monte  Carlo  simulation.  The  estimator  con¬ 
verges  to  the  selected  filter  with  =  1  within  the  2  second  window,  and  the  results 
tabulated  above  accurately  reflect  the  steady  state  mean  square  error.  In  a  system 
with  a  varying  parameter,  the  probabilities  must  be  bounded  below,  as  discussed 
in  Section  2. 1.3. 3,  to  prevent  the  adaptation  mechanism  from  experiencing  lockup. 
This  lower  bound  is  discussed  further  in  Section  3. 1.5. 4. 
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The  optimized  estimator  exhibits  average  mean  square  position  and  velocity 
prediction  errors  lower  than  all  other  methods,  while  providing  good  parameter  esti¬ 
mation.  This  indicates  the  benefits  of  the  proposed  discretization  strategy  in  discern¬ 
ing  the  distinctions  between  alternative  models.  The  benefits  are  clearly  seen  when 
the  results  are  compared  to  method  4,  which  is  the  method  currently  used  at  AFIT. 
The  optimal  estimator  produces  a  mean  square  prediction  error  approximately  one- 
third  of  that  produced  by  the  method  4  system.  As  expected,  the  estimator  w’ith 
the  lowest  overall  prediction  error  is  also  the  estimator  with  the  lowest  overall  fiitf ) 
error. 

It  was  expected  that  the  filter  designed  through  optimization  would  exhibit  a 
mean  square  error  19.2%  lower  than  that  of  method  2.  The  simulation  results  show 
that  the  op*^imized  MMFA  has  a  19.1%  lower  mean  square  error.  Moreover,  the  values 
of  error  calculated  in  the  cost  functional  computations  are  a  very  accurate  prediction 
of  the  simulation  results.  Figures  23  and  24  are  plots  of  the  mean  square  prediction 
error  (±lcr)  shown  in  solid  lines  from  the  simulation  of  the  optimized  design.  The 
predicted  mean  square  prediction  error  from  the  cost  functional  calculations  is  shown 
in  dotted  lines.  The  simulation  results  in  a  mean  square  position  prediction  error 
0.73%  higher  on  average  than  wa>;  predicted.  The  mean  square  velocity  prediction 
error  is  0.70%  lower  on  average  than  was  predicted. 

Note  that  the  standard  deviation,  a,  is  small  when  the  true  parameter  value  is 
close  to  a  parameter  in  the  representative  parameter  set,  and  grows  larger  as  the  true 
parameter  value  moves  away  from  the  representative  parameters.  This  is  indicative 
of  the  degradation  in  performance  of  an  estimator  beuced  on  an  incorrect  parameter. 

The  basic  optimization  algorithm  successfully  produced  a  representative  pa¬ 
rameter  set  for  an  MMFA  with  a  prediction  error  lower  than  any  previously  accepted 
method.  The  numbers  are  specific  to  this  example,  and  different  amounts  of  benefit 
are  likely  for  other  problems.  This  example  shows  that  the  basic  design  method 
is  a  valuable  tool,  both  for  predicting  the  performance  of  MMFA  designs,  and  for 
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Frequency  (rad/sec) 


Figure  23.  Predicted  Versus  Simulation  Position  Prediction  Error 


Figure  24.  Predicted  Versus  Simulation  Velocity  Prediction  Error 
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designing  MMFAs  which  provide  better  estimation  performance.  The  next  few  sec¬ 
tions  illustrate  how  the  basic  algorithm  can  be  modified. 


3. 1.5.1  Effect  of  Various  Weighting  Matrices.  The  weighting  matrix 
W  was  included  in  the  development  to  provide  a  means  for  tradeoff  in  systems 
where  minimizing  the  estimation  error  for  one  state  has  a  detrimental  effect  on  the 
estimation  accuracy  of  another  state.  Such  conflict  is  common  in  practical  engineer¬ 
ing  design.  Our  chosen  example  problem  exhibits  a  rather  interesting  phenomenon. 
Chosing  the  weighting  matrices 


Wi 


1  0 
0  0 


and 


Wj 


0  0 
0  1 


produce  the  same  parameter  discretization,  namely 


=  [25.89] 
=  [41.61] 
a^  =  [56.01] 


which  is  the  same  parameter  set  found  in  Section  3.1.2. 
example,  the  position  prediction  error  is  minimized  when 
error  is  minimized. 


It  seems  that,  for  this 
the  velocity  prediction 


3. 1.5. 2  Using  The  ME/I  Hypothesis.  The  large  jumps  in  the  true  state 
estimation  error  autocorrelation,  seen  in  Figures  16  -  20,  inspired  a  search  for  an 
MMFA  that  selects  the  filter  with  the  lowest  estimation  error  autocorrelation.  The 
ME/I  mechanization,  described  in  Section  2. 1.3.1,  causes  the  MMFA  to  choose  the 
filter  with  the  lowest  residual  autocorrelation,  E{r*'^(<i)r*(f,)},  per  Equation  (2). 
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This  hcLS  the  effect  of  shifting  the  switch  point  as  illustrated  in  Figures  25 
and  26.  Figure  25  shows  the  residual  prediction  error  autocorrelation  of  the  basic 
optimized  estimator  of  Section  3.1.3  assuming  the  standard  ME/M  hypothesis  and 
the  ME/I  hypothesis.  The  estir^ator  using  the  ME/I  hypothesis  selects  the  filter  with 
the  lowest  residual  error  autocorrelation.  Figure  26  is  the  corresponding  velocity 
prediction  error  autocorrelation. 

The  cost  functionals  for  the  five  systems  described  in  Section  3.1.2  are  tabulated 
in  Table  6.  In  most  cases  the  costs  are  significantly  smaller  than  the  corresponding 
costs  for  the  basic  system  shown  in  Table  3.  This  would  lead  us  to  believe  that  an 
MMFA  with  the  ME/I  hypothesis  would  perform  better.  By  replacing  Equation  (54) 
in  the  optimization  algorithm  with  Equation  (58),  the  optimization  algorithm  de¬ 
veloped  in  Section  2.1.2  can  be  used  to  optimize  such  a  system.  For  the  example 
problem,  the  algorithm  finds  the  representative  parameter  set 

=  [26.91] 

=  [43.50] 
a'^  =  [57.00] 

The  cost  for  this  system  (see  Table  6)  combines  a  slightly  higher  position  prediction 
error  with  a  lower  velocity  prediction  error  to  produce  an  overall  cost  which  is  slightly 
lower  than  the  cost  of  the  system  found  in  the  basic  optimization.  This  set  is  shifted 
to  the  right  slightly  from  the  basic  optimized  solution,  reflecting  the  lower  predicted 
error  at  intermediate  points  exhibited  in  Figures  25  and  26. 

This  system  was  evaluated  by  running  the  same  type  of  Monte  Carlo  simula¬ 
tion  as  described  in  Section  3.1.4,  but  using  the  ME/I  hypothesis.  The  hypothesis 
conditional  probability  calculation  was  changed  to  use  Equation  (57)  to  calculate  the 
conditional  density.  The  results,  tabulated  in  Tables  7  and  8,  illustrate  that,  for  each 
estimator,  the  ME/I  mechanization  provides  a  lower  prediction  error  autocorrelation 
on  average. 


74 


Design 

]2 

Method 

(m^) 

(m'^/s'^) 

Method  1 

0.1297 

+ 

1579 

Method  2 

0.0831 

+ 

942 

Method  3 

0.2353 

+ 

3274 

Method  4 

0.1839 

+ 

2497 

Basic  Opt. 

0.0739 

+ 

783 

ME/I  Opt. 

0.0743 

+ 

777 

Table  6.  Costs  for  ME/I  Mechanization 


Quantity  of 
Interest 

Design  Method 

Meth.  1 

Meth.  4 

Basic  Opt. 

ME/I  Opt. 

.MS  Position 
Pred.  Error 
(meters^) 

0.1240 

0.0805 

0.2277 

0.1758 

0.0702 

0.0710 

MS  Velocity 
Pred.  Error 
( meters^ /sec^) 

1520 

920 

3163 

2375 

756 

746 

MS  Para. 

Est.  Error 
(rad^/sec^) 

111.5 

_ 

32.9 

169.5 

502.8 

58.3 

65.2 

Table  7.  ME/I  Hypothesis  Simulation  x(t,  )  Results 


Quantity  of 
Interest 

Design  Method 

Meth.  1 

Meth.  2 

Meth.  3 

Meth.  4 

Basic  Opt. 

ME/I  Opt. 

.MS  Position 
Error 
(meters^) 

0.01075 

0.00908 

0.01574 

0.01026 

0.00902 

0.00910 

MS  Velocity 
Error 

( meters^ /sec^) 

420  7 

306.3 

648.9 

560.0 

280.0 

280.1 

Table  8.  ME/I  Hypothesis  Simulation  x{tf)  Results 
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Table  9.  Optimal  Parameter  Sets  for  Hybrid  Cost  Functional 

The  simulation  results  for  the  optimized  ME/I  system  at  t~  support  the  pre¬ 
diction  made  in  the  preliminary  analysis.  The  position  prediction  error  is  slightly 
higher  than  that  for  the  system  found  with  the  basic  optimization,  whereas  the 
velocity  prediction  error  is  lower  than  that  for  the  system  found  with  the  basic  op¬ 
timization.  These  values  combined  produce  a  lower  overall  result.  The  results  at  if 
show  that  the  ME/I  optimized  system  has  a  higher  overall  error,  but  the  difference 
is  on  the  order  of  the  accuracy  of  the  simulation;  hence  no  conclusion  can  be  drawn 
from  these  results. 

The  ME/I  hypothesis  is  an  ad  hoc  modification  to  the  basic  algorithm.  The 
benefit  to  the  example  problem,  though  small,  is  measurable  and  could  possibly  be 
large  in  a  system  where  AT  varies  a  great  deal  across  the  parameter  set. 

3. 1.5.3  Adding  a  Maximum-Type  Cost.  In  Section  2. 1.3. 2,  the  following 
hybrid  cost  functional  was  proposed  in  order  to  address  maximum  mean  square  error 
specifications; 

J  =  AP-f-(l  -  A)J°°  (135) 

where  A  is  allowed  to  vary  between  a  small  e  and  1. 

Table  9  is  a  summary  of  the  results  of  optimizing  with  this  hybrid  cost  func¬ 
tional  using  different  values  for  A.  With  A  equal  to  one,  the  optimization  is  the 
basic  optimization  problem  with  J^.  As  A  is  decreased,  more  emphasis  is  placed 
on  the  maxim.um  estimation  error  autocorrelation,  forcing  the  largest  representative 


m 


nr/s^ 


m 


m^/s^ 


J  =  AJ^  -f  (1  -  A)J^ 


m 


m^/s^ 


1.0 


O.i 


25.89 


41.61 


28.53 


44.35 


58.40 


0.0742 


797.1 


0.2394 


3575 


0.0742 


0.0760 


805.3 


0.1746 


2533 


0.1056 


797.1 


1324 


0.3 


28.53 


46.50 


58.98 


0.0778 


827.2 


0.1633 


2348 


0.137’ 


1892 


0.1 


31.28 


47.06 


.59.27 


0.0799 


842.5 
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Figure  27.  Prediction  Error  Autocorrelation  for  A  =  0.3 


parameter  value  to  increase  (see  Figure  27  as  compared  to  Figure  20).  This  lowers 
the  maximum  estimation  error  autocorrelation  (i.e.,  peak  value  of  the  curve)  at  the 
expense  of  increasing  the  average  estimation  error  autocorrelation  (i.e.,  area  under 
the  curve).  There  is  a  limit  to  the  benefit  derived  from  adding  a  maximum-type 
cost.  The  value  of  will  never  go  below  0.1294m^  -I-  1776m^/s^,  which  is  the  value 
achieved  by  the  true  optimal  solution  which  requires  an  uncouiitably  infinite  bank 
of  filters. 


3. 1.5.4  Effect  of  Estimator  Lower  Bounds.  We  assumed,  in  the  previ¬ 
ous  sections  of  this  chapter,  that  the  probability  of  the  selected  filter  converges  to 
one.  Once  this  happens,  the  probabilities  of  the  other  filters  are  zero  and  the  prob¬ 
ability  calculation  will  experience  lockup,  as  explained  Section  2. 1.3.3.  This  is  why 
the  true  natural  frequency  was  constant  in  the  simulations  of  Section  3.1.4. 

Using  the  equations  developed  in  Section  2. 1.3. 3,  it  is  possible  to  calculate  the 
cost  of  an  MMFA  for  any  given  lower  bound  on  the  probabilities.  Table  10  shows 
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Table  10.  Cost  Variation  with  Different  Lower  Bounds  on  the  Probabilities 


Quantity  of 
Interest 


0.000 


Pmin 


0.001  0.010  0.100 


MS  Position 
Prediction  Error 
(meters^) 


MS  Velocity 
Prediction  Error 
( meters^ /sec^) 


MS  Parameter 
Estimation  Error 
(rad^/sec^) 


MS  Position 
Error 
(meters^) 


MS  Velocity 
Error 

( meters^ /sec^) 


0.06470 

0.08325 

718.3 

938.6 

62.1 

105.7 

0.00899 


283.5  287.9  382.7 


Table  11.  Simulation  Results  with  Various  Values  of  p^, 


how  the  cost  of  the  basic  optimized  solution  varies  with  a  change  in  the  probability 
lower  bound.  .\s  pmin  is  increased,  the  cost  increases  as  expected. 

The  results  of  simulations  with  various  values  of  are  shown  in  Table  11. 
It  is  evident  from  these  results  that  adding  a  lower  bound  to  the  probabilities  has 
an  additional  benefit  v.’hicli  cannot  be  predicted  from  the  cost  calculations.  It  can 
be  explained,  however,  with  the  help  of  Figures  28  and  29  which  are  plots  of  the 
parameter  estimate  in  a  sample  simulation  with  Pmjn  =  0.001  and  pmjn  =  0.010. 

Ii  the  basic  simulations,  the  MMFA  converges  to  one  of  the  filters  in  the 
bank  and  stays  there.  Since  there  are  a  finite  number  of  filters  and  an  unconntably 
infinite  number  of  possible  true  parameters,  the  selected  filter  in  the  MMFA  is  almost 
surely  based  upon  the  wrong  parameter.  Bounding  the  probabilities  below  allows 
the  MMFA  to  move  off  the  wrong  filter  if  the  dynamics  in  the  system  match  a 
different  filter  at  some  point  during  the  simulation.  The  most  benefit  is  derived  at 
the  points  in  the  parameter  space  where  the  selection  switches  filters.  Figures  30 
and  31  are  plots  of  Monte  Carlo  simulation  results  with  pnun  =  0.001  using  the  basic 
optimization  design.  Note  how  the  mean  square  error  from  the  simulation  is  lower 
around  the  switch  points  of  the  MMFA,  as  compared  to  the  results  displayed  in 
Figures  23  and  24.  This  benefit  is  large  and  masks  the  detrimental  effect  computed 
above  for  small  values  of  pmin-  However,  for  larger  values  of  Pmin,  the  detrimental 
effect  becomes  dominant  cis  illustrated  by  the  results  in  Table  11.  The  amount  of 
benefit  gained  is  likely  to  be  problem  dependent. 

3.1.6  Computational  Burden.  The  computations  required  for  this  research 
were  accomplished  on  an  Elxsi  computer  with  10  central  processing  tmits  (CPUs). 
Each  CPU  is  rated  at  8  VAX  MIPS,  but  the  overhead  due  to  the  multiple  user 
configuration  effectively  lowers  this  rating.  The  cost  functional  calculations  for  the 
example  problem  takes  approximately  six  minutes  of  CPU  time  with  the  parameter 
set  divided  into  200  intervals.  The  majority  of  the  computation  time  is  spent  on 
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Natural  fraqu.ncy  (rad/.ae)  Natural  Frequency  (rad/.ec) 


«  xlO* 


Tima  (eae) 

Figure  28.  Estimator  Convergence  with  Pmin  =  0.001 


Time  (eae) 

Figure  29.  Estimator  Convergence  with  p„un  =  0.010 


solving  the  Lyapunov  equation,  Equation  (47),  for  each  of  the  three  filters,  at  each 
point  evaluated  on  the  curve.  The  software  used  in  this  research  solves  the  Lyapunov 
equations  by  computing  Equation  (47)  for  a  sufficient  number  of  steps  to  reach  the 
desired  level  of  accuracy.  A  good  Lyapunov  equation  solving  routine  would  make 
a  dramatic  impact  on  the  amount  of  computing  power  necessary  for  any  problem. 
With  a  lower  bound  on  the  probabilities,  the  problem  changes  from  solving  three  four¬ 
dimensional  Lyapunov  equations  to  solving  one  eight-dimensional  Lyapunov  equation 
at  each  point  evaluated,  and  the  cost  functional  calculation  takes  approximately  12 
minutes  of  CPU  time. 

The  estimator  optimization  algorithm  is  more  computationally  intensive,  as 
it  involves  calculating  the  cost  functional  at  each  step  in  the  minimization.  Equa¬ 
tion  (47)  must  be  solved  for  each  filter,  at  each  point  evaluated  on  the  curve,  for 
each  cost  functional  evaluation  during  the  optimization.  As  an  example,  if  200  points 
are  evaluated  on  the  curve  for  each  cost  functional  evaluation,  and  the  optimization 
algorithm  takes  40  evaluations,  3  x  200  x  40  =  24,000  Lyapunov  equations  must  be 
solved.  A  representative  optimization  for  the  basic  example  problem  takes  approxi¬ 
mately  3.5  hours  of  CPU  time. 

3.1.7  Summary.  Using  a  vector  minimization  routine  and  an  appropriate 
cost  functional,  a  multiple  model  filtering  algorithm  (MMFA)  was  designed  which 
provides  the  minimum  mean  square  prediction  error  (at  t~),  when  compared  with 
other  previously  accepted  design  methods.  This  is  important  because  many  control 
strategies  depend  upon  the  ability  to  provide  good  prediction  of  future  behavior 
of  the  plant  before  the  next  sampled  data  measurement.  As  anticipated,  this  sys¬ 
tem  also  provides  lower  mean  square  estimati  .  er’-or  (at  tf)  than  any  previously 
accepted  method. 

This  system  was  shown  in  simulation  to  provide  results  for  x{t~ )  very  close  to 
values  predicted  in  the  design  method.  This  makes  the  cost  functional  computation 
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a  very  useful  analysis  tool  in  as  much  ris  it  accurately  predicts  the  performance  of  the 
MMFA  across  the  parameter  space.  Computing  the  cost  functional,  and  specifically 
producing  plots  like  Figures  16  through  20,  provides  a  good  measure  of  the  quality 
of  any  given  design.  Based  on  this  information,  one  can  decide  whether  or  not  an 
optimization  is  warranted. 

Replacing  the  ME/M  hypothesis  with  the  ME/I  hypothesis  allowed  the  MMFA 
to  converge  to  the  filter  with  the  lowest  residual  autocorrelation.  This  resulted  in 
better  performance  for  all  of  the  previously  accepted  designs.  The  optimization 
algorithm  was  modified  to  implement  this  hypothesis  and  produced  a  design  different 
from  the  basic  optimized  design.  This  ME/I  optimized  design  provides  the  lowest 
mean  square  prediction  error,  although  no  substantive  benefit  was  seen  in  the 
results. 

It  was  shown  in  Section  3. 1.5. 3  that  a  maximum-type  cost  can  be  added  to  the 
original  algorithm  to  find  systems  with  a  lower  maximum  error  autocorrelation  than 
the  system  found  with  the  basic  approach.  This  decrease  in  maximum  mean  square 
prediction  error  is  found  at  the  expense  of  increased  average  mean  square  prediction 
error. 

The  ad  hoc  method  of  placing  a  lower  bound  on  the  probabilities  allows  the 
system  to  adapt  to  changing  parameters.  The  calculations  required  for  the  optimiza¬ 
tion  algorithm  can  be  augmented  to  take  this  into  account,  and  they  predict  that  the 
lov.er  bound  will  have  a  detrimental  effect  on  the  estimation  performance.  In  simu¬ 
lation,  however,  the  lower  bound  is  also  shown  to  have  a  beneficial  effect  by  allowing 
the  MMFA  to  move  off  of  a  mismatched  filter  after  the  MMFA  hcis  converged  to  that 
filter.  At  low  values  of  Pmitu  this  beneficial  effect  masks  any  detrimental  effects. 

S.2  Parameter  Estimator  Evaluation 

It  is  important  to  note  that  the  optimal  state  estimator  constructed  in  Sec¬ 
tion  3.1.2  is  not  necessarily  the  best  parameter  estimator.  The  MMFA  constructed 
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Design 

Method 

32 

(rad'^/s^) 

Method  1 

149.0 

Method  2 

40.1 

Method  3 

280.2 

Method  4 

494.8 

Basic  Opt. 

58.6 

Pai  Opt. 

37.0 

Table  12.  Parameter  Estimation  Costs 

using  method  2  has  the  lowest  mean  square  parameter  estimation  error  of  the  five 
methods  tested.  This  leads  us  to  the  conclusion  that  the  best  state  estimator  is  not 
necessarily  the  best  parameter  estimator.  By  using  the  algorithm  in  Section  2.2.1, 
a  parameter  estimator  is  found  which  is  optimal  in  the  sense  that  it  minimizes  the 
cost  functional  of  Equation  (74).  The  set  of  representative  parameters  found  for  the 
example  problem  is: 


=  [18.51] 

=  [35.30] 
a^  =  [53.09] 

This  system  is  spread  more  evenly  in  the  parameter  space  than  the  optimized  state 
estimator.  However,  it  is  not  quite  the  same  as  the  method  2  design  because  its 
switch  points  are  based  on  the  state  estimation  performance  instead  of  geometry  in 
the  parameter  space.  As  shown  in  Table  12,  a  system  designed  with  this  parameter 
set  has  a  lower  cost  than  all  other  methods. 

A  three-filter  MMFA  based  upon  this  parameter  set  was  simulated  in  the  same 
manner  as  the  other  estimators  in  Section  3.1.4.  The  results  are  tabulated  in  Table  13 
along  with  the  results  from  the  previous  tests.  Note  that  the  optimization  produces 
a  parameter  estimator  with  the  lowest  mean  square  parameter  estimation  error.  Also 
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Quantity  of 
Interest 

Design  Method 

Meth.  1 

Meth.  2 

Meth.  3 

Meth.  4 

Basic  Opt. 

Para.  Opt. 

MS  Para. 
Est.  Error 
(rad^/sec^) 

138.8 

39.4 

265.3 

495.8 

_ 1 

59.6 

37.5 

Table  13.  Parameter  Estimation  Results 

note  that  there  is  good  agreement  between  the  predictions  in  Table  12  and  the  results 
in  Table  13. 


3.2.1  Effect  of  Various  Weighting  Matrices.  The  example  problem  used  in 
this  document  has  one  variable,  so  the  presence  of  a  weighting  matrix  in  the  cost 
functional  is  inconsequential.  Kcwever,  the  weighting  matrix  might  be  useful  in  other 
problems  where  it  is  desirable  to  weight  one  parameter  more  than  another. 

It  may  be  necessary  to  use  a  weighting  matrix  to  give  parameters  equal  weight 
if  they  have  different  scales.  Let’s  cissume  that  the  damping  ratio  was  allowed  to 
vary  along  with  the  natural  frequency  in  the  example  problem.  The  damping  ratio 
can  vary  in  the  interval  [0,  1].  The  natural  frequency  can  vary  in  the  interval  [6.28 
rad/sec,  62.83  rad/sec].  Using  a  weighting  matrix  of 


W  = 


1  0 
0  (56.55)2 


[  c  J 

would  put  the  parameters  on  more  equal  terms  in  the  minimization,  in  the  sense  that 
the  same  weight  is  given  to  equal  percentage  variations  of  the  admissible  parameter 
ranges. 
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Design 

Method 

(radVs'^) 

Method  1 

119.28 

Method  2 

33.96 

Method  3 

174.04 

Method  4 

493.45 

Basic  Para.  Opt. 

33.52 

ME/I  Para.  Opt. 

33.24 

Table  14.  Costs  for  Parameter  Estimators  Using  the  ME/I  Mechanization 

3.2.2  Using  the  ME/I  Hypothesis  in  the  Parameter  Estimator.  The  ME/I 
hypothesis,  as  described  in  Section  3. 1.5.2,  may  be  implemented  in  a  parameter 
estimator.  There  is  no  change  to  the  structure  of  the  parameter  estimation  cost 
functional.  The  only  change  to  the  optimization  algorithm  is  the  use  of  Equa¬ 
tion  (58)  in  place  of  Equation  (54).  Assuming  the  ME/I  hypothesis  and  running 
the  optimization  algorithm  of  Section  2.2.1  produces  the  representative  parameter 
set 

a'  =  [17.88] 

=  [35.33] 
a^  =  [53.14] 

The  cost  for  a  system  based  on  this  parameter  set  is  shown  in  Table  14  along  with 
the  corresponding  costs  for  the  other  applicable  systems. 

This  system  was  evaluated  by  running  the  same  type  of  Monte  Carlo  simulation 
as  described  in  Section  3.1.4  using  the  ME/I  hypothesis.  The  results,  tabulated  in 
Table  15,  illustrate  that  optimizing  for  the  ME/I  mechanization  provides  a  parameter 
estimator  with  lower  mean  square  parameter  estimation  error  on  average  than  other 
methods  using  the  ME/I  mechanization.  The  simulation  results  for  the  optimized 
ME/I  system  support  the  prediction  made  in  the  preliminary  analysis,  as  there  is 
good  agreement  between  the  predictions  in  Table  14  and  the  results  in  Table  15. 
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Quantity  of 
Interest 

Design  Method 

Meth.  1 

Meth.  2 

Meth.  3 

.,,1 

Meth.  4 

Para.  Opt. 

ME/I  Opt. 

MS  Para. 
Est.  Error 
(rad^/sec^) 

111.52 

1 

32.94 

_ 1 

169.53 

_ 1 

.502.75 

.. 

33.34 

32.92 

Table  15.  ME/I  Hypothesis  Simulation  Results 

Note  that  the  ME/I  optimized  system  implemented  with  the  ME/I  mechaniza¬ 
tion  exhibits  lower  mean  square  parameter  estimation  error  than  the  best  results 
using  the  ME/M  mechanization  (compare  Table  15  to  Tame  13).  Recall  though, 
that  the  ME/I  hypothesis  is  an  ad  hoc  modification  to  the  basic  algorithm.  The 
benefit  to  the  example  problem,  though  small,  is  measurable  and  could  possibly  be 
large  in  a  system  where  M  varies  a  great  deal  across  the  parameter  set. 

3.2.3  Effect  of  Probability  Lower  Bounds  on  the  Parameter  Estimator.  The 
simulations  performed  for  the  previous  section  allowed  the  probabilities  to  go  to 
zero,  to  make  the  simulation  match  the  prediction  as  closely  as  possible.  Figure  32 
illustrates  how  the  parameter  converges  to  an  estimate  and  stays  there.  The  estimate 
is  the  parameter  on  which  the  selected  filter  is  based.  In  practice,  one  would  place  a 
lower  bound  on  the  probability,  as  explained  in  Section  2.2.2,  allowing  the  MMFA  to 
adapt  to  change  in  tiie  tru®  parameter.  Using  the  expression  in  Equation  (78)  in  the 
parameter  estimator  cost  functional  allows  us  to  compute  the  cost  for  any  given  lower 
bound  on  the  probabilities.  Table  16  shows  how  the  cost  of  the  parameter-optimized 
solution  varies  with  a  change  in  the  probability  lower  bound. 

The  results  of  simulations  with  various  values  of  pmin  are  shown  in  Table  17.  As 
was  the  case  with  the  state  estimator  example,  there  is  a  benefit  gained  by  placing 
lower  bounds  on  the  probability.  This  benefit,  described  in  Section  3. 1.5. 4,  cannot 
be  calculated  beforehand,  and  outweighs  any  calculable  effects  for  small  values  of 
Pmin-  At  larger  values  of  Pmirn  the  calculable  effects  become  dominant,  similar  to  the 
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Esl'atate 


Tima  (aae) 

Figure  32.  Example  Parameter  Estimation,  pmi 


Probability 

Lower  Bound 

(rad'^/sec^) 

0.000 

37.04 

0.001 

37.12 

0.005 

37.51 

0.010 

38.07 

0.050 

45.84 

0.100 

63.75 

Table  16.  Cost  Variation  with  Different  Lower  Bounds  on  the  Probabilities 


Quantity  of 
Interest 


MS  Parameter 
Estimation  Error 
(rad^/sec^) 


Table  17.  Bayesian  Simulation  Results  with  Various  Values  of  p 


0.000 

0.001 

0.010 

0.100 

37.48 

35.92 

37.57 

69.85 
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state  estimator  results  (refer  to  Table  11). 

S.2.4  Summary.  It  is  important  to  note  that  the  optimal  state  estimator 
constructed  in  Section  3.1.2  is  not  necessarily  the  best  parameter  estimator.  Indeed, 
for  this  example,  an  MMFA  was  found  which  provided  parameter  estimation  with 
lower  average  mean  square  error  than  any  previously  accepted  method.  This  was 
accomplished  by  minimizing  a  cost  functional  which  represented  the  mean  square  pa¬ 
rameter  estimation  error  averaged  over  the  admissible  parameter  set.  Sections  3.2,1 
through  3.2.3  illustrate  some  of  the  modifications  which  can  be  made  to  the  basic 
.MMFA  and  the  corresponding  modifications  to  the  optimization  algorithm. 

Using  the  ME/ 1  hypothesis  and  placing  lower  bounds  on  the  probabilities  both 
provided  benefits  in  the  form  of  lower  mean  square  parameter  estimation  error. 
The  system  optimized  for  the  ME/I  hypothesis  provides  the  lowest  average  mean 
square  parameter  estimation  error.  The  benefit  from  placing  lower  bounds  on  the 
probabilities  is  not  calculable,  and  outweighs  the  detrimental  effect  calculated  in  the 
cost  functional  for  lower  values  of  Pmin  which  are  likely  to  be  used  in  an  application. 

3.3  .M.MCA  Evaluation 

In  this  section,  the  controller  optimization  algorithm  is  used  to  design  an 
.M.MC.A  for  the  ideal  mechanical  translational  system  of  Section  3.1. 

3.3.  t  A  Position  Regulator.  The  objective  is  to  design  a  position  regulator 
for  a  system  with  the  following  state  equations: 

x*(t,+J  =  ^*(1,+,,  t,)x*(t,)  +  J3’(<,)u*(t,)  -f  G*(<,)w;(<,)  (136) 

y*(<.)  =  C*(t.)x*(t.)  (137) 

z*(<.)  =  H*(/.)x*(t.)  +  v*(t.)  (138) 

The  output  matrix  C*  is  set  to  [1  0]  to  specify  position  a.s  the  desired  output.  ,-Ml 
other  matrices  are  the  same  cls  they  are  in  the  estimation  problem  of  Section  3.1. 


90 


Figure  33.  Position  Regulator 

Figure  33  is  a  block  diagram  of  an  MMCA  used  as  a  regulator  where  the  objective 


is  to  control  the  output  vector,  y'iti)  =  to  zero.  The  cost  functional  for  this 


system  is  written  as  follows: 

,  _  nE(y'^wy)da 

‘  /^da 

E{xiWxi}da 


(139) 

(140) 


3.3.2  Controller  Selection  Process.  The  filters  in  the  bank  of  controllers  are 
steady-state  Kalman  filters  as  used  in  the  estimator  problem.  The  controller  gains, 
G^,  are  calculated  using  a  sampled  data  Linear  Quadratic  Gaussian  (LQG)  stochastic 
regulator  design  method  as  explained  in  [37,  Sect.  14.2).  The  gains  are  calculated 
to  minimize  a  continuous  time  quadratic  cost  functional  of  the  form 


J  —  f  [x '  (QSx(f) -h  u^(t)Uu(f)]df  (141) 

Jo 

.Minimizing  the  mean  square  position  autocorrelation  corresponds  to  weighting  ma¬ 
trices  of 


(142) 
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and 

U  =[0]  (143) 

Tliese  matrices  place  all  of  the  design  emphasis  on  the  position  of  the  system,  re¬ 
sulting  in  high  gains  on  the  state  feedback  without  regard  to  the  amount  of  control 
energy  needed. 

Using  these  weighting  matrices  and  W  =  [l],  the  controller  design  algorithm 
described  in  Section  2.3.2  finds  the  representative  parameter  set 

=  [25.89] 

=  [41.61] 
a^  =  [56.01] 

which  is  the  same  parameter  set  found  for  the  state  estimation  problem.  One  might 
presume  that,  to  have  the  best  controller  performance,  it  is  more  important  to  have 
accurate  state  estimates  than  parameter  estimates.  This  is  consistent  with  the  idea 
that  a  system  with  full  state  feedback  is  more  robust  than  a  system  which  uses  state 
estimates  for  feedback.  Stability  (and  performance)  can  be  well  preserved  even  with 
controller  gains,  G*,  evaluated  on  the  basis  of  incorrect  parameter  values,  as  long 
as  the  state  values  are  well  estimated.  Therefore,  accuracy  in  the  state  estimates  is 
more  critical  than  precision  of  the  parameters  needed  for  gain  computation. 

3.3.3  Preliminary  Analysis.  A  preliminary  analysis  was  accomplished  by  cal¬ 
culating  the  cost  functional,  of  Equation  (140),  for  each  choice  of  parameters.  As 
shown  in  Table  18,  the  representative  parameter  set  found  by  the  controller  opti¬ 
mization  algorithm  has  the  lowest  cost.  Plots  of  the  steady  state  position  error  auto¬ 
correlation  are  shown  in  Figures  34  -  38  as  a  function  of  the  truth  system  parameter 
for  each  of  the  design  methods.  Triangles  on  the  frequency  axis  denote  elemental 
controller  parameter  values.  The  cost  functional  was  computed  by  integrating  these 
plots. 
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rrelatian  (meter3)«*2 


Design 

Method 

(m") 

Method  1 

0.06461 

Method  2 

0.05530 

Method  3 

0.07993 

Method  4 

0.07289 

Optimization 

0.05076 

Table  18.  Controller  Design  Costs 


Figure  34.  Steady  Stare  Controller  Position  Autocorrelation  Using  Method  1 
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Frequency 


Figure  35.  Steady  State  Controller  Pos 


Frequenc! 

P’lgure  36.  Steady  State  Controller  Pos 


9. 


The  points  at  which  each  MMCA  switches  controllers  is  determined  by  the 
underlying  MMFA  structure,  so  the  discontiiiuiti:,c  in  these  plots  are  at  the  same 
points  as  the  discontinuities  in  the  plots  of  the  estimator  autocorrelations,  Figures  16 
through  20.  For  any  given  controller,  the  curve  is  relatively  flat  at  frequencies  below 
the  design  frequency  and  increases  rapidly  at  frequencies  above  the  design  frequency. 

3.3.4  Controller  Simulation.  To  evaluate  the  parameter  discretization  found 
by  the  optimization  algorithm,  Monte  Carlo  simulations  were  performed,  similar 
to  the  simulations  of  Section  3.1.4.  A  block  diagram  of  the  simulation  is  shown 
in  Figure  39.  Though  our  primary  interest  was  in  x*,  the  state  and  parameter 
estimates  were  also  available  for  analysis,  as  were  the  commanded  control  values. 
Equation  (127)  was  used  to  simulate  the  truth  system  with  Un  constant  throughout 
the  simulation.  Each  MMCA  design  was  tested  by  running  20,  ten-second  simulations 
at  each  of  200  values  of  (a*).  The  probabilities  were  propagated  by  Equation  (1) 
from  an  a  priori  probability  distribution  of  p‘(<o)  =  P^{to)  =  P^(<o)  =  1/3,  and  there 
was  no  lower  bound  on  the  probabilities.  As  was  the  case  with  previous  simulations, 
the  truth  model  was  the  same  structure  as  the  filter  models,  and  the  only  difference 
was  the  value  of  u;„,  so  the  transformation  matrix  T  wa^  the  identity  matrix. 

3.3.5  Results.  The  results  of  the  Monte  Carlo  simulations  are  tabulated  in 
Table  19.  The  values  shown  are  the  average  mean  square  position  of  the  system 
calculated  by 

j  4000  j  1000 

Mean  Square  Position  = -  5^  { -  ^  (144) 

4000  800  ^  ^  ^ 


The  results  of  the  simulation  were  not  the  results  expected  from  the  preliminary 
analysis  shown  in  Table  18.  The  systems  resulting  from  methods  1  and  3  were 
calculated  to  be  stable  across  the  entire  frequency  range  (see  Table  18).  As  it  turns 
out,  they  are  only  marginally  stable  at  certain  points  in  the  frequency  range,  and  they 
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Figure  39.  MMCA  Simulation  Block  Diagram 


Quantity  of  _ Design  Method 


Interest 

Method  1 

Method  2 

Method  3 

Method  4 

Optimization 

MS  Position 
(meters^) 

unstable 

0.05543 

unstable 

0.07312 

0.05416 

Table  19.  Monte  Carlo  Controller  Simulation  Results 
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Figure  40.  Optimized  MMCA  Simulation  Results,  Position  Regulation  Error 

could  not  maintain  stability  during  the  simulations.  The  results  of  the  simulation  of 
the  optimized  system  are  shown  in  Figure  40.  The  system  performs  to  within  7%  of 
the  m.ean  square  regulation  error  predicted  in  the  preliminary  analysis;  however,  the 
cost  of  the  method  2  system  is  within  9%  of  that  of  the  optimized  system  cost.  Since 
the  difference  between  the  costs  of  the  method  2  and  optimized  systems  is  on  the 
order  of  the  accuracy  of  the  simulation,  the  benefit  of  optimization  is  not  verified. 

By  observing  the  process  on  a  system  with  more  dynamic  driving  noise,  the 
benefits  of  optimization  become  apparent.  To  illustrate  this  point,  the  example 
problem  was  changed  slightly  and  the  strength,  Q,  of  the  dynamic  driving  noise  was 
increased  two  orders  of  magnitude  so  that: 

+  r)}  =  QS{t)  =  1.0  S{t) 

This  increased  energy  in  the  system  provides  for  much  more  cost  differential  between 
systems,  and  hence,  more  dramatic  results.  Since  this  change  affects  the  filter  gains 
and  the  residual  characteristics,  it  is  necessary  to  run  the  optimization  algorithm  for 
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Design 

Method 

(m-^) 

Method  1 

unstable 

Method  2 

unstable 

Method  3 

unstable 

Method  4 

4.344 

Optimization 

2.894 

Table  20.  Controller  Design  Costs  with  Q  =  1.0 

this  new  problem.  The  optimization  algorithm  produces  the  system 

a*  =  [30.64] 

=  [43.85] 
a^  =  [56.39] 

The  costs  computed  for  this  system  and  the  systems  found  with  the  other 
methods  are  shown  in  Table  20.  The  primary  impact  of  the  increased  dynamic  driving 
noise  was  to  move  a^  and  a^  to  the  right.  With  the  higher  dynamic  driving  noise,  the 
elemental  controllers  cannot  maintain  stability  if  the  true  natural  frequency  is  much 
higher  than  the  frequency  assumed  within  the  controller.  The  algorithm  naturally 
compensates  for  this.  This  phenomenon  is  also  the  cause  for  method  2  to  be  unstable 
with  this  higher  value  of  Q  when  its  results  are  close  to  the  optimized  results  at  the 
lower  value  of  Q  (refer  to  Table  19). 

The  systems  found  using  methods  1  through  3  are  unstable  at  some  point  in  the 
expected  frequency  range.  A  designer  using  one  of  these  methods  would  assume  that 
more  controllers  were  needed  to  maintain  control  over  the  entire  frequency  range. 
This  is  not  the  case,  as  the  optimization  algorithm  found  a  three  controller  system 
that  maintains  stability  over  the  entire  parameter  set. 

The  MMCA  is  unstable  when  the  eigenvalues  of  the  Y  matrix  of  Equation  (103), 
are  outside  the  unit  circle  at  some  point  in  the  parameter  set.  In  order  to  keep  the 
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Quantity  of 
Interest 

Design  Method 

Method  1 

Method  2 

Method  3 

Method  4 

Optimization 

MS  Position 
(meters^) 

un.stable 

unstable 

unstable 

4.366 

2.934 

Table  21.  Monte  Carlo  Controller  Simulation  Results  with  Q  =  1.0 


Figure  41.  MMCA  Simulation  Results  with  Q  =  1.0.,  Position  Regulation  Error 

optimization  algorithm  viable  when  it  evaluated  an  unstable  MMCA,  the  cost  func¬ 
tional  was  computed  as  if  the  unstable  region  was  not  in  the  admissible  parameter 
set,  and  a  penalty  function  was  added  proportional  to  the  size  of  the  unstable  region. 
This  allowed  the  optimization  algorithm  to  move  toward  an  acceptable  solution. 

All  five  systems  were  simulated  in  the  same  manner  cis  presented  previously, 
and  the  results  of  the  simulations  are  presented  in  Table  21.  The  system  found  by 
the  optimization  algorithm  performs  as  predicted,  and  this  is  shown  in  Figure  41. 
The  simulation  results  are  within  1.4%  of  the  predicted  mean  square  regulation  error. 
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3.3.5. 1  Effect  of  The  Weighting  Matrix  on  Controller  Performance.  In 
Section  3.3.2,  a  position  regulator  wa^  designed  by  placing  all  emphasis  on  the  po¬ 
sition  state.  If  the  velocity  is  also  of  interest,  the  output  matrix  can  be  changed 
to 


in  Equation  (137).  This  opens  the  door  to  the  use  of  various  weighting  matrices.  As 
an  example,  suppose  we  keep  Q  =  l.Q  and  choose 

1  0 

W=  (14.5) 

0  0.01  J 

This  matrix  places  a  small  additional  weight  on  the  velocity. 

The  controller  gains  are  calculated  to  minimize  a  continuous  time  quadratic 
cost  functional  of  the  form 

J  =  {  (x^(t)Sx(t) -f  u'*'(t)Uu(<)]d<  (146) 

Jo 

To  be  consistent  with  the  choice  of  W,  we  choose 

1  0 

S=  (147) 

0  0.01 

and 

U  =  [0]  (148) 

Note  that  this  choice  produces  different  gain  matrices,  G^,  than  the  choice  of  S  and 
U  used  previously. 

Using  these  weighting  matrices,  the  controller  design  algorithm  described  in 
Section  2.3.2  finds  the  representative  parameter  set 


a‘  =  [25.24] 
=  [41.11] 
a^  =  [62.56] 
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Quantity  of 
Interest 

Design  Weight 

W  =  diag[l. 0,0.0] 

W  =  diag[l.0,0.0l] 

MS  Position 
(meters^) 

2.93 

27.49 

MS  Velocity 
(meters^/sec^) 

273793 

46303 

Table  22.  Monte  Carlo  Controller  Simulation  Results 

Note  that  these  parameters  are  spread  apart  farther  than  the  parameters  for  the 
position  regulator. 

This  system  wa^  simulated  in  the  same  manner  as  the  simulations  in  Section 
3.3.4.  The  results,  tabulated  in  Table  22,  show  that  this  system  has  a  lower  mean 
square  velocity  regulation  error  at  the  expense  of  a  higher  mean  square  position 
regulation  error  as  expected. 

3.3.5.S  Using  the  ME/I  Hypothesis  in  an  MMCA.  The  solution  to  the 
basic  control  problem  suggests  that  the  best  control  results  from  getting  the  best 
state  estimates.  To  improve  the  state  estimation,  we  used  the  ME/I  hypothesis 
which  chose  the  filter  with  the  smallest  prediction  error  covariance.  It  would  be  a 
normal  assumption  that  the  controller  performance  could  be  improved  by  using  the 
ME/I  hypothesis.  It  would  also  be  wrong  for  the  problem  at  hand.  Figures  42  and 
43  show  why  this  is  so. 

In  the  estimation  problem,  with  Q  —  1.0,  it  would  be  desirable  to  shift  the 
switch  points  to  the  right  so  that  the  filter  with  the  lowest  prediction  error  autocor¬ 
relation  is  chosen  (see  Figure  42).  In  the  control  problem,  with  Q  =  1.0,  the  switch 
points  corresponds  to  points  where  the  position  regulation  error  autocorrelation  is 
approximately  the  same  for  both  filters  involved  (see  Figure  43).  Shifting  the  switch 
points  to  the  right  would  force  the  MMCA  to  select  a  controller  with  a  higher  position 
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regulation  error  autocorrelation,  thus  resulting  in  a  larger  cost.  Calculating  the  cost 
of  the  control  system  selected  in  Section  3.3.2  shows  that  we  could  expect  it  to  go 
unstable  using  the  ME/I  hypothesis.  Even  optimizing  for  the  ME/I  hypothesis,  we 
cannot  find  a  system  with  a  lower  cost  than  that  found  with  the  basic  optimization 
for  this  problem. 

The  ME/I  hypothesis  is  an  ad  hoc  addition  to  the  basic  MMCA  structure.  It 
is  shown  to  be  detrimental  in  the  controller  example  with  Q  =  \.0.  This  section  has 
been  provided  to  caution  the  reader  in  the  use  of  the  ME/I  hypothesis. 

3. 3. 5. 3  Adding  a  Maximum- Type  Cost  to  the  Controller  Problem.  In  a 
practical  design  problem,  the  maximum  allowable  mean  square  error  may  be  specified 
beforehand.  In  this  case,  it  may  be  desirable  to  add  a  penalty  function  to  the 
quadratic  cost  functional,  as  explained  in  Section  2. 3. 3. 2,  in  order  to  address  this 
specification. 

The  optimized  system  for  Q  =  1.0  has  a  maximum  position  regulation  error 
autocorrelation  of  12.7  m^  at  u.’„  =  62.83  rad/sec.  Suppose  the  designer  specifies 
a  maximum  mean  square  position  regulation  error  of  11.0  m^.  Adding  a  penalty 
function,  like  that  described  in  Section  2. 3. 3. 2,  and  proceeding  with  the  optimization 
produces  the  representative  parameter  set; 

a‘  =  [31.82] 

=  [45.00] 
a^  =  [57.81] 

Compared  to  the  representative  parameter  set  found  with  the  basic  optimization 
algorithm,  this  set  hcis  slightly  higher  values  of  a’,  a^  and  a^,  and  a  little  more  space 
between  a^  and  a^.  An  MMCA  designed  with  this  set  has  a  basic  cost,  of  2.922  m^ 
and  a  maximum  regulation  error  autocorrelation  of  10.99  m^  at  =  62.83  rad/sec 
(see  Figure  44).  This  system  meets  the  maximum  error  specification  at  the  expense 
of  average  performance. 


Figure  44.  Steady  State  Controller  Position  Eiror  Autocorrelation  Using  a  Penalty 
Function 

3.S.5.4  Effect  of  Probability  Lower  Bounds  on  Control.  In  Section  2. .'I. .3. .3, 
the  controller  optimization  algorithm  was  modified  to  include  lower  bounds  on  the 
hypothesis  conditional  probabilities.  Using  Equation  (122)  in  the  controller  opti¬ 
mization  algorithm  enables  us  to  calculate  the  cost  of  a  system  for  any  given  lower 
bound  on  the  probability.  Table  23  shows  how  the  cost  of  the  basic  optimized  solu¬ 
tion  varies  with  a  change  in  the  probability  lower  bound.  As  Pnun  is  increased,  the 
cost  increases,  as  expected.  Note  that  large  values  of  pnun  force  the  MMCA  to  use 
too  much  of  the  input  from  elemental  controllers  that  cannot  maintain  stability,  thus 
making  the  entire  system  unstable. 

The  results  of  controller  simulations  with  various  values  of  are  shown  in 
Table  24.  It  is  evident  from  these  results  that  adding  a  lower  bound  to  the  proba¬ 
bilities  has  the  same  additional  benefit  as  was  shown  in  the  estimator  simulations. 
This  benefit  is  gained  by  allowing  the  MMCA  to  move  off  a  mismatched  filter  once 
it  has  converged,  and  it  cannot  be  predicted  by  the  cost  functional  computations. 


10'^ 


Probability 
Lower  Bound 

m^ 

0.000 

2.8942 

0.001 

2.8943 

0.010 

2.9126 

0.050 

unstable 

0.100 

unstable 

Table  23.  Controller  Cost  Variation  with  Different  Lower  Bounds  on  the  Probabilities 


Quantity  of 
Interest 

pmin 

0.000 

0.001 

0.010 

0.100 

MS  Position 
Regulation  Error 
(meters^) 

2.934 

2.931 

3.032 

unstable 

Table  24.  Simulation  Results  with  Various  Values  of  Pmin 

However,  the  detrimental  effect  predicted  by  the  cost  functional  computations  be¬ 
comes  dominant  at  larger  values  of  Pmim  and  finally  forces  the  system  to  be  unstable. 

3. 3. 6  Summary.  Using  a  vector  minimization  routine  and  an  appropriate  cost 
functional,  an  MMCA  was  designed  which  provides  the  minimum  mean  square  posi¬ 
tion  regulation  error  when  compared  with  other  previously  accepted  design  methods. 
It  was  also  shown  to  provide  an  acceptable  control  system  for  a  problem  in  which 
the  three  geometric  methods  do  not  provide  a  controller  which  is  stable  across  the 
entire  parameter  interval.  This  system  is  shown  in  simulation  to  provide  results  very 
close  to  values  predicted  in  the  design  method. 

Variations  on  the  original  design  method  are  explored  in  Sections  3.3.5. 1  through 
3. 3. 5. 4.  By  manipulating  the  weighting  matrices  in  the  cost  functional  and  control 
gain  design  method,  it  is  possible  to  optimize  the  control  system  to  provide  the 
minimum  mean  square  error  for  linear  combinations  of  the  state  vector  components. 
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It  is  shown  in  Section  3. 3. 5. 2  that  replacing  the  ME/M  hypothesis  with  the  ME/I 
hypothesis  is  not  always  desirable. 

A  penalty  function  can  be  added  to  the  original  controller  optimization  algo¬ 
rithm,  as  shown  in  Section  3. 3. 5. 3,  to  find  systems  with  a  maximum  error  autocor¬ 
relation  within  specifications.  This  decrease  in  maximum  mean  square  regulation 
error  comes  at  the  expense  of  increaised  average  mean  square  regulation  error. 

The  ad  hoc  method  of  placing  a  lower  bound  on  the  probabilities  allows  the 
system  to  adapt  to  changing  parameters.  The  calculations  required  for  the  opti¬ 
mization  algorithm  can  be  augmented  to  take  this  into  account.  The  lower  bound  is 
predicted  to  have  a  detrimental  effect  on  the  estimation  performance.  However,  the 
lower  bound  is  also  shown  in  simulation  to  have  a  beneficial  effect  by  allowing  the 
MMFA  to  move  off  a  mismatched  filter  after  the  MMFA  has  converged  to  that  filter. 
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IV.  Conclusions  and  Recommendations 


4.1  Conclusions 

This  dissertation  is  a  summary  of  a  research  effort  into  the  field  of  adaptive 
estimation  and  control,  using  a  multiple  model  filtering  algorithm  (MMFA),  with 
a  finite  number  of  models,  to  estimate  the  states  of  a  system  with  an  unknown 
parameter  vector  that  can  vary  over  a  continuous  region  of  the  parameter  space. 
Designing  an  MMFA  involves  choosing  a  finite  subset  of  this  uncountably  infinite 
parameter  set,  and  designing  filters  based  on  this  representative  subset.  The  problem 
precipitating  this  research  effort  is  that  previously  accepted  methods  for  choosing  this 
subset  of  parameters  are  ad  hoc  and  provide  no  means  to  measure  the  performance 
of  the  resulting  system.  The  goal  of  this  research  effort  has  been  the  development  of 
an  optimization  procedure  for  choosing  this  discretization  of  the  parameter  space. 

Toward  this  end,  an  algorithm  was  developed  to  find  a  representative  parameter 
set  for  an  MMFA  which  minimizes  a  cost  functional  representing  the  mean  square 
state  prediction  error  (at  t~)  averaged  over  the  admissible  parameter  set.  This 
optimization  algorithm  was  used  to  design  an  MMFA  for  a  simple  second  order 
problem.  Simulations  supported  the  prediction  that  the  representative  parameter 
set  produced  by  the  optimization  algorithm  formed  the  basis  of  an  MMFA  with 
lower  mean  square  state  prediction  error  on  average  than  any  previously  accepted 
method  of  model  selection.  As  anticipated,  this  MMFA  also  exhibited  the  lowest 
average  mean  square  state  estimation  error  (at  tf).  The  algorithm  is  general  for  any 
finite  dimensional  linear  time  invariant  system  and  includes  a  means  to  weight  the 
states  separately  for  cases  in  which  there  is  a  tradeoff  between  states. 

The  number  of  filters,  K,  in  the  MMFA  is  specified  by  the  designer.  It  was 
hoped  that  an  efficient  procedure  could  be  developed  for  choosing  the  number  of 
filters  based  upon  an  allowable  value  of  the  average  mean  square  prediction  error. 
Unfortunately,  the  form  of  the  optimization  algorithm  constrains  this  procedure  to 
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1.  Choose  a  low  value  of  K  for  an  initial  trial. 

2.  Run  the  optimization  algorithm  for  this  K. 

3.  If  the  average  prediction  error  autocorrelation  is  above  the  allowable  value, 

increase  K  and  repeat  step  2. 

4.  If  the  average  prediction  error  autocorrelation  is  acceptable,  stop. 

This  is  hardly  an  efficient  procedure,  but  it  does  allow  the  designer  to  find  the 
minimum  number  of  filters  which  provide  acceptable  performance,  assuming  that 
any  implementable  MMFA  can  provide  acceptable  performance. 

By  replacing  the  state  estimation  cost  functional  with  a  functional  representing 
the  average  mean  square  parameter  estimation  error,  the  basic  optimization  algo¬ 
rithm  was  modified  to  find  a  representative  parameter  set  for  an  MMFA  to  estimate 
the  true  parameter.  The  best  state  estimator  is  not  necessarily  the  best  parameter 
estimator,  as  was  shown  in  Section  3.2.  The  parameter  estimation  optimization  al¬ 
gorithm  found  a  representative  parameter  set,  generally  different  from  the  set  found 
in  the  state  estimator  optimization,  which  provided  lower  mean  square  parameter 
estimat'on  error  on  average  than  any  previously  accepted  method. 

Following  the  basic  development  of  the  estimator  algorithm,  a  separate  al¬ 
gorithm  was  developed  to  find  a  representative  parameter  set  for  a  multiple  model 
control  algorithm  (MMCA),  which  minimizes  a  cost  functional  representing  the  mean 
square  state  regulation  error  averaged  over  the  admissible  parameter  set.  Through 
evaluation  on  a  simple  second  order  problem,  this  algorithm  was  shown  to  find  a 
representative  parameter  set  which  formed  the  basis  of  an  MMCA,  with  lower  mean 
square  regulation  error  on  average  than  any  previously  accepted  design  method. 

The  prediction  error  autocorrelation  is  discontinuous  at  the  points  in  the  pa¬ 
rameter  space  where  the  MMFA  can  switch  from  one  filter  to  the  next.  With  the 
basic  form  of  the  MMFA,  the  estimator  switches  to  a  filter  with  a  larger  predic¬ 
tion  error  autocorrelation  due  to  the  method  of  calculating  the  probabilities  within 
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the  MMFA.  All  three  algorithms  were  modified  to  allow  the  MMFA  to  choose  the 
filter  with  the  lowest  residual  autocorrelation,  by  including  the  ME/I  hypothesis 
as  described  in  Section  2. 1.3.1.  This  wa3  shown  to  be  beneficial  in  the  estimator 
example,  and  to  be  detrimental  in  the  controller  example  with  increased  dynamics 
noise  strength,  Q  =  1.0.  The  benefit  gained  by  making  this  modification  is  problem- 
dependent,  and  should  be  larger  for  problems  in  which  the  residual  autocorrelation, 
M  of  Equation  (2),  changes  dramatically  across  the  admissible  parameter  set. 

It  was  shown  that  the  cost  functionals  used  in  this  dissertation  could  be  aug¬ 
mented,  in  two  different  ways,  to  include  a  cost  on  the  maximum  mean  square  error. 
In  the  estimator  example,  an  additional  cost  proportional  to  the  maximum  mean 
square  prediction  error  was  augmented  to  the  basic  cost  functional  to  produce  sys¬ 
tems  with  lower  maximum  mean  square  errors.  This  decrease  in  maximum  mean 
square  prediction  error  comes  at  the  expense  of  increasing  the  average  mean  square 
prediction  error.  In  the  controller  example,  a  penalty  function  was  added  to  allow 
the  optimization  algorithm  to  find  the  best  MMCA  with  a  specified  maximum  mean 
square  regulation  error. 

Modifications  to  the  basic  algorithms  were  developed  to  take  into  account  the 
ad  hoc  procedure  of  placing  lower  bounds  on  the  probabilities.  These  lower  bounds 
prevent  the  probability  of  any  filter  from  reaching  zero  and  removing  that  filter  from 
further  use  in  the  estimation  process.  For  the  example  problem,  these  modifications 
predicted  that  placing  lower  bounds  on  the  probabilities  would  have  a  detrimental 
effect  on  the  estimator  performance.  However,  this  procedure  also  has  a  beneficial 
effect,  which  manifested  itself  during  simulation,  that  could  not  be  predicted  by  the 
cost  functional  calculations  alone. 

4-2  Contributions 

Perhaps  the  most  useful  contribution  of  this  dissertation  is  the  procedure  for 
calculating  the  cost  functional.  The  cost  functional  computation  was  shown,  in 
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Section  3.1.5,  to  be  a  very  useful  analysis  tool  in  cis  much  a.s  it  accurately  predicts 
the  performance  of  the  MMFA  across  the  parameter  space.  Computing  the  cost 
functional,  and  specifically  producing  plots  like  Figures  16  through  20,  provides  a 
good  mea.sure  of  the  quality  of  any  given  design.  Diflferent  MMFA  designs,  for  the 
same  problem,  can  be  compared  quantitatively,  simply  by  calculating  the  appropri¬ 
ate  cost  functional.  Ba.sed  on  this  information,  one  can  decide  whether  or  not  an 
optimization  is  warranted. 

The  cost  functional  calculations  can  also  form  the  basis  of  an  analysis  of  lower 
order  estimators.  The  algorithm  development  of  Section  2. 1.1.1  includes  a  transfor¬ 
mation  matrix,  T,  so  that  the  estimating  filter  need  not  be  the  same  dimension  as 
the  truth  model.  By  calculacing  the  cost  for  estimators  of  various  dimension,  they 
can  be  quantitatively  compared. 

The  controller  cost  functional  calculations  can  form  the  basis  of  a  control 
method  comparison.  By  calculating  the  cost  functional  for  MMCAs  with  elemental 
controller  gains  calculated  by  various  methods,  such  as  eigenstructure  assignment 
versus  LQG,  these  different  design  methods  can  be  quantitatively  compared. 

The  main  contribution  of  this  dissertation  is  a  formal  method  for  designing 
MMFAs  and  MMCAs  where  none  existed  previously.  This  method  requires  the 
designer  to  specify  the  number  of  elemental  filters  or  controllers  to  be  used,  the 
weighting  matrix  in  the  cost  functional,  and  the  method  of  controller  gain  calculation. 
The  algorithm  can  then  be  automated  to  find  an  optimized  design. 

4-3  Recommendations 

The  control  structure  presented  in  this  dissertation  is  a  multiple  model  reg¬ 
ulator  in  which  the  state  vector  is  regulated  to  zero.  The  modifications  necessary 
for  designing  trackers,  proportional-plus-integral  (PI)  controllers,  or  other  control 
laws  [37,  Chap.  14]  were  not  developed  and  should  be  studied  in  future  research. 
In  most  cases,  this  involves  augmenting  the  system  models  with  additional  shaping 
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filter  states  or  dynamics  states.  This  does  not  significantly  increase  the  complexity 
of  the  optimization  algorithms,  but  incrccises  the  dimension  of  the  problem. 

In  the  preliminary  MMCA  design,  it  was  found  that  the  best  MMCA  had  the 
same  representative  parameter  set  as  the  best  state  estimating  MMFA,  An  analytical 
study  is  recommended  to  see  if  this  is  always  the  case,  or  if  it  has  something  to  do 
with  the  structure  of  the  example  problem. 

Previous  research  [29,  40,  50]  brought  to  light  problems  with  tuning  the  elemen¬ 
tal  controllers  in  an  MMCA  to  increase  robustness  to  unmodelled  disturbances.  This 
tuning  tends  to  confound  the  identification  process,  making  an  argument  against  the 
application  of  loop  transfer  recovery  techniques  [2]  in  elemental  controller  design. 
An  in-depth  study  of  the  concept  of  robustness  as  applied  to  multiple  model  control 
systems  is  necessary  so  that  the  effect  of  robustness  recovery  techniques  on  the  iden¬ 
tification  process  can  be  better  understood.  This  could  possibly  be  tied  in  with  an 
analysis  of  cost  functionals  which  enhance  identifiability,  with  an  end  towards  design¬ 
ing  .MMCAs  which  converge  faster,  as  opposed  to  MMCAs  with  optimum  average 
performance. 
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A  method  is  proposed  for  designing  multiple  model  adaptive 
estimators  to  provide  combined  state  and  parameter  estimation  in  the 
presence  of  an  uncertain  parameter  vector.  It  is  assumed  that  the 
parameter  varies  over  a  continuous  region  and  a  finite  number  of 
constant-gain  filters  is  available  for  the  estimation.  The  estimator 
elemental  filters  are  chosen  by  minimizing  a  cost  functional 
representing  the  average  state  prediction  error  autocorrelation,  with 
the  average  taken  as  the  true  parameter  ranges  over  the  admissible 
parameter  set.  A  second-order  example  is  used  to  illustrate  the 
increase  in  performance  over  previously  accepted  filter  selection 
methods.  By  minimizing  a  cost  functional  representing  the  average 
parameter  prediction  error  autocorrelation,  a  parameter  estimator  is 
designed  which  is  different  from  the  state  estimator.  The  parameter 
estimator  found  with  this  method  provides  lower  average  mean  square 
parameter  estimation  error  than  previously  accepted  design  methods. 

An  analogous  method  is  proposed  for  designing  multiple  model 
adaptive  controllers  to  provide  stabilizing  control  in  the  presence  of 
an  uncertain  parameter  vector.  A  finite  number  of  constant-gain 
controllers  is  used  to  regulate  a  system  with  a  parameter  vector  that 
varies  over  a  continuous  region  of  the  parameter  space.  The  controller 
elemental  filters  are  chosen  by  minimizing  a  cost  functional 
representing  the  average  regulation  error  autocorrelation,  with  the 
average  taken  as  the  true  parameter  ranges  over  the  admissible 
parameter  set.  A  second-order  example  is  used  to  demonstrate  the 
improvement  in  performance  over  previously  accepted  controller 
selection  methods.  Modifications  to  the  estimator  and  controller 
optimization  algorithms  are  investigated  which  provide  the  additional 
capabilities  of:  changing  the  filter  selection  mechanization,  using 
different  types  of  cost  functionals,  and  placing  lower  bounds  on  the 
model  probabilities  to  keep  the  system  open  for  adaptation. 


